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FOREWORD 


The  first,  in  the  series  of  annual  meetings  entitled  Army 
Conferences  on  Applied  Mathematics  and  Computing,  was  held  on 
9-11  May  1983  at  George  Washington  University,  Washington,  DC. 
These  meetings,  sponsored  by  the  Army  Mathematics  Steering 
Committee  (AMSC),  combines  two  symposia,  namely  the  Conferences 
of  Army  Mathematicians  and  the  Numerical  Analysis  and  Computers 
conferences.  The  joining  of  these  two  meetings  underscores  the 
growing  synergism  between  applicable  mathematics  and  computing. 

In  the  next  two  paragraphs  short  histories  of  the  two  forerunners 
of  these  new  conferences  are  presented. 

The  Office  of  Ordnance  Research,  now  the  Army  Research 
Office,  planned  a  series  of  symposia  called  Ordnance  Conferences 
of  Arsenal  Mathematicians.  The  first  of  these  meetings  was  held 
on  29  October  1954  at  Watertown  Arsenal .  The  purpose  of  these 
meetings  was  to  focus  the  attention  of  applied  mathematicians  on 
some  specific  ordnance  projects  needing  high  level  scientific 
knowledge.  Following  the  Seventh  Conference  of  Ordnance 
Mathematicians,  the  AMSC  became  the  sponsor  for  these  symposia. 
This  committee  requested  that  these  conferences  be  held  on  an 
Army-wide  basis,  and  that  their  name  be  changed  to  Conferences  of 
Army  Mathematicians.  This  name  continued  through  the 
twenty-eight  conference,  which  was  held  on  28-30  June  1982  at  the 
Uniformed  Services  University  of  Health  Sciences,  Bethesda, 
Maryland. 

About  five  years  after  the  first  Conference  of  Arsenal 
Mathematicians  was  held,  the  Office  of  Ordnance  Research 
organized  an  00R  Liason  Group  on  Computers.  Two  of  these 
meetings  were  held,  one  in  1959  and  the  other  in  1960,  to 
exchange  information  of  interest  to  managers  of  ordnance 
computers.  [These  gatherings  can  be  considered  as  the  forerunners 
of  the  Numerical  Analysis  and  Computers  Conferences.]  The  AMSC 
decided  that  these  meetings  should  be  conducted  on  an  Army-wide 
basis  and  be  entitled  the  ARO  Working  Groups  on  Computers.  Their 
purpose  was  to  provide  a  format  for  exchanging  ideas  on  the 
Army's  desires,  capabilities,  and  interest  in  the  field  of 
'other-than  business'  application  of  computers,  and  they  should 
provide  AMSC  and  ARO  with  information  on  the  Army's  need  for 
computers,  requirements  for  assistance  in  research  in  numerical 
analysis,  and  other  kinds  of  mathematics.  Two  meetings,  one  in 
1962  and  the  other  in  1964,  were  held  under  the  above  mentioned 
title.  Starting  in  1965  these  symposia  were  held,  except  for 
1973,  on  a  yearly  basis,  and  at  first  were  entitled  Army 
Numerical  Analysis  Conferences.  Starting  with  the  1975  meeting 
they  became  known  as  the  Numerical  Analysis  and  Computers 
Conferences.  The  last  meeting  in  this  series  was  held  on  3-4 
February  1982  at  the  U.  S.  Army  Engineer  Waterways  Experiment 
Station,  Vicksburg,  Mississippi. 
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Members  of  the  AMSC  would  like  to  thank  Professor  Nozer  D. 
Singpurwalla  of  George  Washington  University,  Washington,  DC,  for 
his  invitation  to  hold  the  first  meeting  of  this  new  series  of 
conferences  at  his  university.  They  would  also  like  to  thank  him 
for  providing  such  excellent  facilities  for  the  symposium,  and 
for  all  the  help  he  and  other  members  of  this  staff  provided  to 
insure  a  smooth  running  meeting. 

The  Program  Committee  was  especially  pleased  with  the  quality 
of  the  contributed  papers  for  this  meeting.  Also  the  number  of 
these  papers,  57,  helped  get  these  new  conferences  off  to  a  good 
start.  There  was  a  Special  Session  on  Distributed  Command  and 
Control.  The  speakers  in  this  session  covered  topics  of  special 
interest  to  various  Army  groups.  A  list  of  the  invited  speakers 
along  with  the  titles  of  their  addresses  is  noted  below.  Members 
of  the  Committee  are  sorry  to  report  that,  due  to  a  previous 
engagement.  Dr.  Karl  J.  Astrom  of  the  Lund  Institute  of  Control, 
Sweden,  was  unable  to  accept  our  invitation  to  address  this 
conference. 

Speaker  and  Affiliations  Ti tl e 

Professor  Julian  Cole  Pertubation  Techniques  for 

Rensselaer  Polytechnic  Institute  Fluid  Dynamics 

Professor  Andrew  Majda  Stability  and  Instability 

University  of  Cal i form* a-Berkeley  for  Shock  Waves 

Professor  Sanjoy  K.  Mitter  Adaptive  Controls 

Massachusetts  Institute  of 
Technology 

Dr.  M.  Yousuf f  Hussaini  Spectral  Methods  for  Partial 

NASA  Langley  Research  Center  Differential  Equations 

Professor  Carl  de  Boor  Multivariate  B-Splines 

Mathematics  Research  Center, 

University  of  Wisconsin-Madison 

Members  of  the  AMSC  would  like  to  thank  the  speakers  and  all 
the  other  individuals  who  contributed  to  the  success  of  this 
meeting.  They  have  requested  that  most  of  the  contributed  papers 
be  made  available  in  printed  form.  These  research  articles  will 
enable  many  persons  that  could  not  attend  the  smposium  to  profit 
by  these  contributions  to  the  scientific  literature. 
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ASYMPTOTICS  AND  NUMERICS 


Julian  D.  Cole 
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Rensselaer  Polytechnic  Institute 
Troy,  NY  12181 


ABSTRACT.  Limit  process  expansions  applied  to  problems  for 
partial  differential  equations  are  discussed.  Three  different 
types  of  non-uniformities,  non-linear  far  field,  non-linear 
local  singularity,  and  singular  boundary  conditions  are  illus¬ 
trated  by  example.  The  use  of  different  limits  to  construct 
overlapping  expansions  is  emphasized.  Matching  is  sketched 
brief ly . 


I.  INTRODUCTION.  The  combination  of  asymptotic  and  numer¬ 
ical  methods  provides  a  very  useful  approach  to  a  wide  variety 
of  problems  involving  partial  differential  equations.  In  many 
cases  an  "exact"  problem  can  be  formulated  which  is  non-linear. 
This  non-linear  problem  in  its  dimensionless  form  contains 
various  parameters.  When  a  parameter  is  effectively  small  (or 
large)  the  dependence  of  the  solution  on  the  parameter  can  be 
studied.  The  dependence  is  asymptotic  and  can  vet v  often  be 
connected  with  a  limit  process  in  terms  of  rescaled  coordinates 
and  similarity  parameters.  Very  often,  too,  different  expan¬ 
sions  are  needed  to  achieve  validity  in  different  regions. 
Expansions  in  adjacent  regions  of  physical  and  parameter  space 
can  be  connected  by  asymptotic  matching.  The  purpose  of  the 
asymptotic  analysis  is  to  define  the  simplest  problems  illus¬ 
trating  the  phenomena.  Both  qualitative  and  quantitative 
understanding  is  sought.  The  limit  processes  used  to  construct 
the  expansions  bring  out  the  similarity  parameters  of  the 
problem  and  provide  scaling  laws. 

In  problems  that  are  intrinsically  non-linear  the  represen¬ 
tative  simplified  problems  retain  that  feature.  Thus  numerical 
work  comes  into  the  picture.  The  asymptotic  analysis  isolates 
the  crucial  numerical  problem  and  can  reduce  the  amount  of 
computations  required  by  an  order  of  magnitude.  Typical  prob¬ 
lems  have  to  be  solved  numerically  in  infinite  or  semi-infinite 
domains.  But  asymptotic  matching  provides  the  necessary  far 
field  boundary  conditions  and  determines  the  constants.  Because 
of  the  asymptotic  analysis  the  numerical  problems  contain  the 
minimum  number  of  parameters.  Further  the  variables  and 
parameters  are  scaled  to  0(1),  simplifying  the  numerics. 

Limit  process  expansions  are  applicable  to  all  problems 
where  the  limits  of  the  solutions  exist.  If,  for  example, 
oscillatory  functions  appear  which  do  not  have  limits,  then 
multi-scale,  WKB,  or  averaging  methods  can  be  adopted.  Limit 
process  expansions  are  considered  here.  An  illustrative  form 
for  a  function  of  one  variable  x,  and  two  parameters  e, m  is  the 
following 


1 


for  a<x<b.  e  is  the  basic  parameter?  £<<1.  The  aj(e)  form  an 
asymptotic  sequence  so  that  as  e+0  ,  aj+1+0 .  x  -  x(x;e), 

aj 

V  ~  y(e,y)  are  definite  functions  chosen  for  the  purposes  of  the 
expansion.  p(e,y)  is  a  similarity  parameter.  The  choices  are 
not  arbitrary  but  are  connected  with  the  requirement  that  the 
limit  process  associated  with  (1)  be  distinguished  (cf  [l ] ) • 

The  successive  terms  in  (1)  are  computed  from  f(x;e,p)  by 
successive  application  of  the  limits  (e+0,  x,V  fixed).  Thus 


lim 

e>0 
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These  expansions  are  not  usually  valid  in  the  entire  region 
(a,b)  but  have  some  restricted  domain  of  validity,  in  which  the 
asymptotic  nature  of  (1)  remains.  When  the  validity  breaks  down 
(non-uniformity)  an  adjacent  expansion  is  sought,  which  can  be 
matched  to  the  previous  one  in  an  overlap  domain. 


In  the  following  sections  three  examples  are  presented  in 
which  asymptotic  analysis  leads  to  a  better  understanding.  In 
the  first  involving  water  waves  there  is  a  non-uniformity  at 
infinity  and  also  as  the  parameter  Froude  number  F+l.  The  reso¬ 
lution  of  the  non-uniformity  at  infinity  is  a  non-linear  far 
field  expansion.  The  resolution  of  that  as  F+l  seems  to  demand  a 
numerical  calculation  of  an  unsteady  flow.  The  second  example 
of  transonic  slender  body  theory  shows  a  boundary  value  problem 
which  is  singular  because  the  body  shrinks  to  a  line.  The  inner 
expansion  preserves  body  geometry  and  is  valid  for  small  radius. 
The  outer  expansion  preserves  non-linear  transonic  effects  and 
is  valid  from  some  region  near  the  body  to  infinity.  These 
expansions  do  not  really  match  so  that  an  intermediate 
expansion,  valid  for  radius  of  order  of  the  body  length  is 
necessary.  The  third  example  considers  thin  supersonic  wings 
whose  edges  are  swept  to  (or  close  to)  the  Mach  angle.  Although 
the  linearized  theory  gives  a  finite  answer  it  is  not  correct 
because  of  a  local  singularity  at  the  edge.  An  expansion  valid 
near  the  edge,  containing  non-linear  transonic  terms,  must  be 
constructed  and  matched  to  the  linear  solution  away  from  the 
edge.  An  analogy  is  thus  shown  between  the  steady  three- 
dimensional  flow  near  the  edge  and  two-dimensional  unsteady 
small-disturbance  transonic  flow. 

Another  connection  of  asymptotic  and  numerics  is  the 
asymptotic  analysis  of  finite  difference  schemes  as  the 
mesh-size  approaches  zero,  and  the  analysis  of  truncation  error. 
These  aspects  are  not  discussed  here. 


II.  TWO  DIMENSIONAL  FREE  SURFACE  FLOW  PAST  A  BUMF.  The 
exact  problem  for  an  inviscid  heavy  fluid  is  illustrated  in 


Fig.  1.  Continuity  and  irrotationality  lead  to  the  Laplace 
equation.  Flow  is  assumed  steady.  Boundary  conditions  of 
tangent  flow  are  applied  at  the  free-surface  and  at  the  bump  and 
the  Bernoulli  equation  is  used  to  express  p~0  on  the  free 
surface . 


T 
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FIG.  1.  EXACT  PROBLEM 


FKouPe  //*. 

F/t/tt  on  H 


X 


In  the  exact  formulation  velocities  are  scaled  by  the  free- 
stream  speed  U,  lengths  by  the  bump  length  L,  pressure  by  pU2 . 
The  exact  problem  is 

4>xx  +  4>yy  =  0  (II-l) 


free  (II-2) 

surface 


6  (x,ef(x)  =  ef  '  (x)  <j>  (x,  ef  (x) )  |  x  |  <  1 

JL  £ 

tanqency  (I I™ 3) 

=  0  |  x I >1 


4>y  ( x , ys  ( x ) )  =  ys  1  U)  4>xU«ys< *) ) 

'^{*x2V}+  y  =^  +  h 


The  parameters  are 

e  =  thickness  ratio  of  bump 


F 

h 


Froude  No. 
H/L  . 


=  u/VgH  =  flow  speed _ 

y  '  fastest  linear  wave  speed 
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For  a  small  bump  e<<l  a  limit  process  expansion  associated  with 
the  limit  e+0,  (x,y),(F2,h)  fixed  linearizes  the  problem.  There 
is  a  small  perturbation  of  uniform  flow  .  The  form  for  the 
potential  <j> ,  and  free  surface  shape  ys  is 

4>(x,y;e,F,h)  =  x  +  e<^(x,y;F,h)  +  e2<P2  ( x,y ;  F  ,h  )  +  ... 

ys'(x;e,F,h)  =  h  +  En(x;F,h)  +  E2n2(x;F,h)  +  ... 

The  linear  problem  that  results  was  solved  by  Kelvin  and  Lamb 

[2] 

^xx  +(f’yy  =  0  (II-5) 

F2hVxx(x,h)  +<Py(x,h)  =  0  (11-6} 

<py(x,0)  =  £'  (X)  .  (11-7) 

The  solution  has  the  features  shown  in  Fig.  2. 


FIG.  2.  LINEARIZED  FLOW  PAST.  BUMP 


For  F<1  the  flow  dips  down  over  the  bump  and  rises  with  a  wave 
train  behind.  There  is  a  finite  wave  drag.  For  F>1  no  linear- 
standing  waves  are  possible.  The  flow  rises  over  the  bump  and 
falls  behind.  The  drag  is  zero.  The  linear  expansion  has  a 
non-uniformity  as  F+1-; f or  F<I  the  wave  length  gets  longer  as  ■ 
F+l—  and  finally  the  solution  has  the  asymptotic  form  (if  is 
worked  out) 

n  ■*  -  2*  /  n0  +  -  JLx4  ( II-8 ) 

h2  ^  4h? 
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From  this  it  is  clear  that  there  is  a  non-uniformity  downstream 
when  Non-linear  wave  effects  become  important 

downstream. 

In  order  to  study  this  a  new  limit  is  considered  in  which 
F+l  and  the  observer  goes  farther  and  farther  downstream  as  e+0. 
That  is 


e  +  0,  ( x=eV3x  ,y ) ,  ( Kf  h  )  fixed  where  F2=l-Ke2/3  . 

K  is  a  similarity  parameter.  The  powers  of  e  are  chosen  for  a 
distinguished  limit.  x  changes  are  slow.  The  Laplace  equation 
is  approximated  by  <j>yy  =  0  and  in  effect,  iterated.  The  form  is 


<p  =  x  +  eV3ai(x)  +  e{a2(x)-y2a1' '  (x)/2)  +  e5/3{a3(x)-y2a2' 1  (x)/2 

4-  y4a^ 1 1 ' ' ( x )/24  }  +  ... 
ys  =  h  +  £2/3ci(x)  +  e4/3?  2(*)  +  (II-9) 

The  equation  for  the  free  surface  that  holds  downstream  is 


2 

d  Cl  +  f  2  +  3K  ?  «  o  (11-10) 

dx2  2h2  1  h2  1 


The  solution  is  chosen  so  that,  for  matching,  as  x+0+,  0+-3/h2 

(cf  II— 8 ) .  A  is  related  to  the  area  of  the  bump.  If  0(x=O)  is 
of  sufficiently  small  magnitude  there  is  matching  to  a  closed 
trajectory,  a  downstream  non-linear  (cnoidal)  wave  train 
periodic  in  x.  For  K=Kcr,  9  appears  at  0max  and  then  the 
solution  is  the  downstream  part  of  a  solitary  wave.  The 
ultimate  level  downstream  is  lower  than  that  upstream.  But  no 
solution  exists  for  K<Kcr  so  that  there  is  a  further 
non-uniformity  as  F  gets  closer  to  1.  The  supercritical 

solution  F> 1  also  blows  up  as  F+1+.  For  a  further  discussion  of 
this  problem  see  [3]. 
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Ill-  TRANSONIC  SLENDER  BODY  THEORY.  Transonic  slender 
body  theory  shows  a  non-unitormity  ot  a  different  nature. 
Consider  a  slender  body  as  in  Fig.  4  flying  close  to  the  speed 
of  sound.  The  theory  is  worked  out  in  the  framework  of  gas 
dynamics  (inviscid,  perfect  gas).  Shocks  are  weak 


«4  off) 


FIG.  4.  SLENDER  BODY 


so  that  the  flow  remains  isentropic  and  a  potential  $  exists. 
Lengths  are  scaled  by  the  body  length ,  velocities  by  the  free* 
stream  speed  U.  The  small  parameter  6  is  the  body  thickness 
ratio  and  measures  the  order  of  the  flow  deflection.  The  angle 
of  attack  a,  not  appearing  explicitly,  is  assumed  0(6).  The 
continuity  equation  becomes  the  compressible  potential  equation 


a2V2$ 


7$ 


(III-l) 


where 
2  =  1 


a 

U2 


Y-l 


M 


1  ~ 


U2 


y^ratio  of  specific  heats=f£ 

cv 


(III-2) 


(III-2)  is  the  compressible  Bernoulli  equation,  a=local  speed  of 
sound,  M^free -stream  Mach  number=U/a Equation  (III-l)  is 
non-linear^of  changing  type,  locally  elliptic  where  the  flow  is 
subsonic^  |q’|<a,  and  locally  hyperbolic  where  the  flow  is  super¬ 
sonic  |q|>a.  The  boundary  condition  of  tangent  flow  has  to  be 
satisfied  on  the  body  surface  B=0, 

on  Bfx,X,z_\  =0  ,  q. VB  =  0  •  (III-3) 

l  5  V 

The  approximations  are  concerned  with  6+0,  M„+l*  In  outer 
coordinates  the  body  shrinks  to  a  line.  An  inner  expansion  is 
needed  to  preserve  the  body  geometry  and  satisfy  (III-3).  Thus 
the  limit  5+0,  ( x,y*=Y. , z *=JL)  fixed  is  considered.  At  the  same 
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time  M  +1  such  that  K  =  1  Mgl  fixed,  as  considerations  of  the 

GO  O 

$2 

outer  limit  show  is  necessary.  The  inner  expansion  is  of  the 
form 

$  =  U-|x  +  <$2i0g  6  2S  (  x  )  h  63<£(  x ,  r*f  0  )  +64log  2  6^22 

+  64 log  6^21+ <$4(?2  +  -  *  (II I- 4) 

where  r*2=y*2+z*2  ,  e=tan~l^X_^. 


This  expansion  includes  so-called  switch  back  terms  introduced 
for  purposes  of  matching.  The  dominant  equation  of  the  inner 
expansion  is  the  Laplace  equation  in  each  cross-section  plane 

<?y*y*+<Pz*z*  =  0  (III-5) 

The  outer  expansion,  valid  to  infinity  has  to  preserve  the 
non-linear  changing  type  structure.  Since  disturbances  spread 
laterally  to  much  greater  distances  than  upstream,  the  observer 
has  to  run  to  infinity  laterally .  The  limit  process  has  6+0 

(x,r=6r,9;K)  fixed  where  r*/y2+z2.  The  expansion  is  of  the  form 
(anticipating  some  matching) 

-  U{  x+6^4,  (  x ,  r )  +  64  $2  (  x ,  r ,  6  )  +.  .  .  }  .  (III-6) 

The  resulting  dominant  equation  is  the  transonic  small- 
disturbance  equation 

(K-(y+1U  )A  +  +  IK  =0  (III-7) 

x  xx  rr  ^  r 

Matching  is  concerned  with  behavior  as  r*+®,  r+0.  We  know  that 
far  av/ay  the  solution  of  (III-5)  looks  like  a  source-line,  with 
S(x)  a  source  strength 

<P=  S  (  x  )  logr  *  +  g  ( x )  +  ...  (III-8) 

Formally  matching  for  the  first  terms  shows  that  indeed  <\>  is 
axisymmetric.  But  the  two  expansions  do  not  really  match  as  can 
be  inferred  from  the  fact  that  r  +  ®  in  the  outer  limit  and  r+0  in 
the  inner.  The  difficulty  is  manifest  in  the  higher  orders.  An 
intermediate  expansion  is  thus  needed  and  turns  out  to  be 
connected  with  the  limit  6+0,  (x,y,z?K)  fixed.  The  form  is 


$  =  U{x+62$(x,y,z;K)+63<t>i+64<|>2+.  .  .  }  -  (111*9) 


The  equations  are 
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<J>yy  +  <j>zz  “  0 


(III-10) 


+ 


=  0 
zz 


(III-ll) 


4>2  +  ^2  “  ^+1HJXX  "  K<t>  +  i-(  <t>  2+<j>  2  )  -  ( 1 1 1-12  ) 

^yy  Zzz  x  xx  xx  3x  Y  z 

The  intermediate  expansion  must  match  to  the  inner  expansion  as 
r=Vy2+z2“  +  o  r  (r**°°)  and  to  the  outer  as  r*»  (r+0).  Matching 
shows  that 


<jt  =  S  ( x )  logr  +  g(x)  .  (III-13) 

The  first  term  on  the  right  hand  side  of  (III-12)  is  dominant 
as  and  is  the  non-linear  term  necessary  to  match  to  the 

outer  expansion  as  r*0,  (III-7).  The  last  term  on  the  .right 
hand  side  of  (III-12)  dominates  as  r+0  and  is  the  non-linear 
term  necessary  to  match  to  the  higher  order  inner  terms. 


The  outcome  of  all  this  is  a  well  posed  boundary  value 
problem  for  ( 1 1 1  —  7 )  r<j>r*S(x)  r  +  0  where  S(x),  from  the  inner 
expansion,  is  related  to  the  rate  of  change  for  cross-section 
area  of  the  slender  body.  The  dominant  outer  flow  is  axisym- 
metric.  This  defines  g(x).  Further  a  deeper  understanding 
of  the  non-axisymmetric  flow  fields  is  gained. 


IV.  PSEUDO-TRANSONIC  FLOW.  In  the  same  general  framework 
as  the  preceeding  section,  pseudo-transonic  effects  occur  when 
the  leading  edge  of  a  thin  supersonic  (M„>1)  wing  is  swept  to 

the  Mach  angle  0.  =tan“l  (See  Fig.  5) 

M  U 


i* 


>  i 

MACH  C0Mf 


/=  /  F(*,  *) 


X 

PSEUDO-TRANSONIC  FLOW 
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a  now  measures  the  thickness  ratio  of  the  wing.  Linearized 
supersonic  theory  is  traditionally  used  to  calculate  the  flow 
about  such  wings.  The  physical  content  is  that  of  acoustics. 

The  limit  process  has  6+0  (x,y,z;M«)  fixed.  The  corresponding 
expansion  is  of  the  form 

$  =  U{x+6<j>(x,y,z;Mop)+.  ,  .  }  (IV-1) 

The  disturbance  potential  <j>  satisfies  the  classical  wave 
equation 


1  )  <£  xx  -  (  (j)yy  +  <PzZ  )  -  0  (IV-2) 

The  boundary  value  problem  in  y>0  can  be  solved  by  a 
distribution  of  supersonic  sources  over  the  wing  planform  in 
y =0 .  For  the  special  case  of  a  wedge  airfoil 

F(x,z)  =  (x-  z )3.  the  boundary  condition  of  tangent  flow 

is 

<{>y(x,0+,z)  =  1  { IV- 3  ) 

The  solution  is 

♦(x,y,z>  =-i  //  (1/2)dCd;  ,  82=M  2-l  (IV-4) 

*  real  /  (x-£  )  z-$z(z-t; )  z-Bzy2 


-  -  _1  / x2-$ 2 ( y2+z2 )  +  y  cos"1  3Y 
tt6  tt 

The  square  root  behavior  indicates  a  singularity  in  $x  at  the 
leading  edge  and  a  non-uniformity  of  linearized  theory.  The 
component  of  flow  normal  to  the  edge  is  exactly  sonic  so  that 
transonic  effects  could  be  expected.  Although  the  singularity 
is  integrable  the  finite  results  in  this  case  are  not  in  good 
agreement  with  experiment.  A  better  treatment  locally  comes 
from  the  following  limit 

6+0  (£  =  X“6Z  ,  ti=  Y  ,  c=z  f  M  )  fixed. 

62/3  6 i/3 

The  expansion  is  valid  in  a  local  region  near  the  leading  edge; 
The  form  turns  out  to  be 

$  =  U{x  +  64/3  (  n#  C  ;Mco)+- ,  .  }  ( IV-6  ) 


The  distinguished  equation  for  <P  that  results  is 

(t+1)M -  ^nn  +  2e<P5?  =  0  (IV-7) 
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This  equation  is  in  perfect  analogy  with  the  unsteady  version  of 
the  two-dimensional  transonic  small  disturbance  equation  ( 1 1 1— 7 ) 
with  K-0.  Here  £  +  x,  n+y,  ^Vt.  This  says  that  the  flow  can  be 
computed  as  if  there  were  unsteady  flow  at  each  spanwise  section 
starting  at  the  apex.  As  usual  (IV-7)  admits  shock  waves.  The 


solution 

solution 

dominant 

far  away  from  the  edge  must  match  to  the  linear 
( I V- 5 ) . If  we  write  (IV-5)  in  inner  coordinates  and  keep 
terms  we  see  that 

d>  +  -  i 

A2'3 2B?c-«2/Vn2  +  61/3i  cos*1 

«V3Bt) 

TT 

/6*/ 

Thus  for 

match ing 

cp  +  ■ 

-  1  ^2BEe-B2n2  +  n  cos-1  3n 

8"  *  TfeTc 

as  £ ,  n+0°  ( IV-8  ) 

This  provides  the  far  field  boundary  condition  for  (IV-7),  and 
calculations  can  be  performed.  This  particular  problem  is 
conical  and  can  be  simplified  further.  It  can  be  shown  that  the 
drag  including  pseudotransonic  effects  is  reduced. 

The  results  here  indicate  that  the  flow  near  a  high  aspect 
ratio  wing  swept  to  the  Mach  angle  should  also  be  approximated 
by  (IV-7).  This  is  the  case  for  the  wing  in  Fig.  6. 


FIG.  6.  SWEPT  HIGH  ASPECT  RATIO  WING 
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(IV-9) 


The  expansion  is  of  the  form 

*(x,y,z)  =  U{x+«2/3  ( z, n, ?;B+, . . . } 

where  5=x-ez,  n=sV3y,  5=6V3Z  and  the  similarity  parameter 

B=6V3b. 

The  results  of  this  section  are  easily  extended  to  cases 
where  the  edges  are  close  the  Mach  angle. 
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AN  EQUIVALENT  GAS  MODEL  FOR  DUSTY  GASES 


Donald  A.  Drew*  and  Fredrick  J.  zeigler** 

1.  INTRODUCTION 

Many  fluid  flow  problems  of  interest  concern  the  behavior  of  a  gas  which 
has  been  contaminated  with  small  particles  of  dust.  The  presence  of  the  dust 
can  cause  significant  changes  in  the  flow,  and  it  is  important  to  analyze  an 
explanation  for  this  phenomenon.  This  is  done  by  examining  a  model  in  which 
the  gas  and  dust  exchange  heat  and  momentum.  In  the  limit  of  low  volumetric 
concentrations  of  dust,  but  with  strong  coupling  between  the  phases,  the  model 
equations  are  closely  approximated  by  the  equations  for  an  adiabatic  ideal 
gas,  with  modified  values  of  the  density  and  ratio  of  specific  heats.  By  the 
use  of  similarity  transformations  of  these  equations,  it  is  possible  to  relate 
solutions  of  flow  problems  for  a  gas  with  dust  to  solutions  of.  corresponding 
problems  for  a  clear  gas,  thus  giving  an  explicit  way  of  calculating  the 
effect  of  the  dust  on  the  flow.  Because  of  their  simple  form,  the  equations 
of  transonic  flow  are  used  to  provide  an  example  of  this  procedure.  It  is 
found  that  the  transonic  flow  around  a  thin  airfoil  for  a  gas  with  dust  is 
equivalent  to  the  flow  around  an  airfoil  with  modified  thickness,  at  a 
different  free-stream  Mach  number. 


♦Department  of  Mathematical  Sciences,  Rensselaer  Polytechnic  Institute, 
Troy,  NY  12181 

♦♦General  Dynamics  Corporation,  Fort  Worth  Division,  P.  0.  Box  748,  Mail 
Zone  2882,  Ft.  Worth,  TX  76101 


Sponsored  by  the  United  States  Army  under  Contract  No.  DAAG29-80-C-0041 . 
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2.  EQUATIONS  OF  TWO-PHASE  FLOW 


The  equations  governing  the  flow  of  particles  and  fluid  are 


3ap 

•'-T72  +  v*aP  q  =  0  / 

3t  P  P 


8(1  -  a)P  ^ 

- — -  +  V*(1  -  a)pfqf  =  0  , 


<2. la) 


(2.1b) 


up 


3ci  ^  ^  ^ 

(3? +  VvV  =  "a7p  +  “V%  '  V  ' 


3q 


(i  “  «)pf(g~  +  qf*vqf)  *  -<“*  -  <*>vp  +  <*kM<qp  ”  qf> 

8(en+  jj)  .  - 


ap  [— ^ 


St 


+  Vv(ep  +  2 


(2. 2a) 


(2.2b) 


=  -  V*apq^  -  pVot* +  abM^f  ~  +  aHM*Tf  *  Tp*  '  (2.3a) 


r9(ef  +  2  H  ♦  1  2  1 

(i  -  «>pf [ — Hr — +  vv<ef +  2  qf - 


-V*  (1  -  cOpq^  -  pVa*q^  -  Ctb^q^  -  q^)*^  +  aHM*Tp  “  Tf  >  '  (2.3b) 


where  2.1,  2.2,  and  2.3  are  equations  of  conservation  of  mass,  momentum,  and 

■¥ 

energy  for  the  particle  and  fluid  phases.  Here  q  represents  the  velocity 

(p  for  particles,  f  for  fluid),  ct  is  the  dust  volumetric  density,  p 

denotes  the  phasic  density,  p  is  the  pressure,  and  e  is  the  Internal 

energy  per  unit  mass. 

The  dust  is  assumed  to  be  incompressible 

p  =  const  ?  (2.4) 

P 

the  ideal  gas  law 
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holds  for  the  fluid.  We  assume 


and. 


+  p/P- 


(2,6) 


The  terms  on  the  right  of  (2.2)  and  (2.3)  involving  t>M  and  reflect 

momentum  and  energy  transfer  between  the  particles  and  gas.  For  b^  we 
assume  Stokes  drag 


9  1 

bM  2  PfVf  2  * 
a 


(2.7) 


The  heat  transfer  coefficient  is  assumed  to  be 

«M  “  3  NU  *  bM  * 

where  Nu  is  the  Nusselt  number  and  Pr  is  the  Prandtl  number. 


(2.8) 


The  dusty  gas  limit  of  the  previous  two-phase  flow  equations  corresponds 

to  the  limit  a  <<  1,  p  <<  p  and  ap  ~  .  In  order  to  exhibit  this 

f  p  p  f 

limit  explicitly  we  first  scale  the  variables  and  put  the  equations  into 
nondimensional  form. 

Thus,  we  assume  the  problem  contains  a  typical  length  scale  L,  velocity 
scale  U,  gas  density  scale  F,  temperature  scale  TQ,  and  volumetric 
concentration  scale  A.  If  we  denote  a  dimensionless  variable  by  a  caret  (A), 
we  have 


+  V-(^  )  =  0 

St  P 


(3.1a) 


ap 

at 


f  ~  a 

-  +  v-pfqf  =  0 


(3. 1b) 


9qp  -*  *+  1  + 

+  q  *Vq_  =  —  " 

at 


q  *  Vq  =  ( q  -  q  ) 

p  ef  f  p 


3q. 


at 

[3<Ep  +  v^p + 1  ?>]  -  h  <i  -  v*s +  h 

at 


(T,  - 


T  ) 
P 


*  12 

A  r9<Sf  +  2  <3f)  +  *  *  1  A2vl 

P,[ - - -  +  <V7(ef  +  2 


St 


(3.2a) 


(3.2b) 


(3.3a) 


AA+  i  A  +■  +  +  hAA 

-  - V •  pq  -  —  a(q  „  -  q  ) * q  +  —  a(T  -  T  )  . 

er  “ -F  ^-1-1  f 


p  ^p  e 


(3.3b) 


The  equations  of  state  take  the  form 

A  A  A 

P  “  rpfTf  , 


£  =  cT 

P  P 


ef  =  cTf  +  p/pf 


(3.4) 

(3.5) 

(3.6) 
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The  dimensionless  constants  are  defined  by 


« =  (g-KjK-i-) 

f  9pf 

h  -  HHT0/bMu2 


r  =  RTqAJ 


c  =  c*P*T  /U2 
v  0 


;  =  c<fw 

v  0 


(3.7a) 

(3.7b) 

(3,3c) 

(3.3d) 

(3. 3e) 
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4.  GENERALIZED  GAS  MODEL 


we  see  that  for  sufficiently  small  particles ,  e  will  be  small.  The 

rest  of  our  discussion  will  be  limited  to  the  regime  e  <<  1. 

For  this  case,  inspection  of  equation  3.2  shows  that  q  «  q  (we  now 

f  p 

drop  the  carets  for  convenience)  unless  the  accelerations  are  large.  This 

suggest  a  model  which  we  term  the  generalized  gas  model,  which  has  the 
->■  -► 

property  that  q^  =  +  0(e),  except  in  places  where  the  flow  fields  change 

rapidly.  We  shall  call  these  regions  of  rapid  change  generalized  shocks.  As 

we  shall  see,  this  model  is  analogous  to  a  gas  with  changed  properties. 

+  + 

Consider  the  flow  fields  u,  q^,  ,  etc.  as  functions  of 


x,  t,  and  e.  In  the  generalized  gas  model,  we  consider  a  limit  of  the 

4- 

dusty  gas  equations  in  which  t  +  0,  with  x,  t  held  fixed.  Away  from 
generalized  shocks,  we  therefore  consider  an  expansion 

a(x,t?e)  ”  ot^°\x,t)  +  ea^(x,t)  +  ...  (4.2) 

with  similar  expressions  for  the  other  flow  quantities. 

Substituting  these  expressions  into  the  dusty  gas  equations  and  equating 
terms  of  equal  order  to  zero  gives  the  following: 


+  <0)  +<0)  _  + 
%  =  s  =  q 


(4.3) 

(4.4) 

(4.5) 

(4.6) 

(4.7) 
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(4.8) 


■>"'(!»♦  -  V' *  .‘"’(Si''  -i:”. 


1  2 

3(CT  +  -q_)  +  .>V(CT+  1  2)]  _  (5(D  _  |(,))<$+  h(T<1) 

^  £  p  f 


at 


(0)  *  1  2v 

(0)r3Uf  »  * 

Pf  L  9t 


♦*v<.‘0,+4,2>]  = 


T  ) 
P 


Um  (0)* 
-V*p  q 


<o> +<n 

Ct  <qf 


+  <1>,  +  x 

)-q  +  ha  (T 

P 


s 


JD, 

Tf  > 


(0) 


<ok 

rpf  T 


JO)  -  *  .  (0)/ft(0) 
ef  =  ct  +  p  /p 


This  model  allows  us  to  derive  the  equations  needed  at  the  lowest 

(0) 

rimes  *<*./>  i 

k 

■f  ”  ~'v3t 

A  similar  combination  of  (4.9)  and  (4.10)  gives 


order.  Adding  a  times  (4.7)  to  (4*8)  yields 


(P, 


(  0  )  r  3 cT  -|  ( 0  )  r  3 CT  -+■  i 

P,  LT—  +  q"  VcT  ]  +  a  9 1[~  +  q-VcTj  + 


'f  L3t  '  *  ¥~*J  '  "  ^L3t 

1  2 
f 


a/  (0),  (OK 

.  (0)r3<P  /Pf  }  +  „  .  (0)  .  (0)  1  „  (0)  + 

Pf  ^ — - -  +  q-V(p  /pf  )J  =  -v*p  q  . 


Additionally,  (4.5)  and  (4.6)  may  be  combined  as 


JO)  ,  ^<0  )  _.  . 

d  (p  +  a  f )  dp 

_L_ - -+  v.(pj  +  a  }f)$  =  0  =  5^+  vP-q 


(4.9) 


(4.10) 


(4.11) 


(4.12) 


(4.13) 


(4.14) 


(4.15) 
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where  p  =  +  a^f  is  the  mixture  density.  Using  (4.5)  and  (4.6)  we 

m  f 


obtain 


(0) 


k  +  ^«n(a(0>/P<0))  -  0 


(4.16) 


Therefore  is  constant  for  a  fluid  particle.  We  shall  assume  that 

is  constant  over  the  entire  flow  domain.  If  we  subtract  the 
kinetic  energy  equation  from  (4.14),  and  use  (4.6)  and  (4.11),  we  have 


(pf^«(0)fc  +  p|0)r)[^S.VT]=rT[ 


3p 


(0) 


3t 


+  q*vPf0)] 


(4.17) 


Therefore 


where 


T  -  Const* (p^°  ^ 1 


(4.18) 


T  =  T 


;  +  a<°>fc/pj°> 


T  + 


q^f 

(0) 


c  +  r 


c  +  r  +  a<°>fc/p<°> 


1  + 


a(0>f 

JO) 


c  +  r 


(4.19) 


Dropping  the  (0)  superscript  for  the  pressure,  the  generalized  gas 
model  is 

TT  +  7'pm*  -  0  (4’20) 


Pm^ft  +  =  _Vp  ' 


(4,21) 


P/P0  =  (VV  ' 


(4.22) 
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where  PQ  and  pQ  are  constants.  Thus,  a  dusty  gas  with  small  dust 
particles  behaves  like  a  gas  with  a  modified  y.  We  emphasize  that  this 
derivation  assumes  that  we  are  not  near  a  shock. 

We  note  that  the  speed  of  sound  for  a  generalized  gas  is  given  by 


=  dp_  ^  I£  ^ 


TP, 


(0) 


d p_ 


m 


m 


TP 


r-BM 
V  (0)> 


TP, 


(0) 


m 


TP 


m 


where  a  is  the  speed  ^ sound  in  the  clear  gas. 


(0)  i 

P  ”  P_  +  CL  f  / 
m  t 


TP, 


TP 


m 


_  I 
T 


1  + 


q(0)f 

.(0) 


Moreover,  since 


21 


5.  SMALL  DISTRUBANCE  THEORY 

The  dusty  gas  may  be  regarded  as  an  equivalent  gas  with  a  modified  value 
of  y.  Assuming  that  we  have  potential  flow,  it  is  therefore  possible  to 
formulate  a  small-disturbance  theory  for  transonic  thin  airfoils  in  dusty 
gases,  analogous  to  that  in  [2].  The  transonic  similarity  parameter  for  this 
case  is  a  function  of  the  new  value  of  Y»  It  will  be  shown  how  this  modified 
similarity  parameter  is  related  to  the  usual  one  for  the  case  of  no  dust,  so 
that  a  method  of  estimating  the  effect  of  the  dust  on  the  usual  small- 
disturbance  theory  may  be  achieved, 

"We  formulate  the  boundary  value  problem  for  a  thin  airfoil,  in  a  dusty 

gas  and  travelling  in  the  transonic  range.  The  free-stream  velocity  is  U 

(in  the  x  direction),  where  the  coordinates  x,  y  have  been  normalized  with 

respect  to  the  airfoil  chord.  The  airfoil  is  given  by  y  *  6f  0(x),  where 

the  function  F  satisfies  max  |F  (x)  -  F  (x)  |  -  1.  The  free  stream  Mach 
*  *  xe  [0,1]  u 

number  is  M  =  U/a  . 

CO  00 

Under  the  condition  of  small  disturbances,  we  may  derive  an  approximate 

equation  by  a  limit  process  expansion  for  the  velocity  potential  based  on 

*2 

1/3  -  1  "  Mw 

the  limit  process  6  +  0,  with  x,y  =  6  y,  and  K  =  — 2/2“  f^xed  as 
A  <5 

M  +  1 .  It  has  the  form 


$(x,y;6)  =  U{x  +  5  <j>(x,y)  +  •••}  • 


(5.1) 


Then  <i>  satisfies 


(K  -  (Y  +  1  )<f>  )<$>  +  <J>~~  -  0 

T x  Txx  yy 


(5.2) 


and  the  boundary  conditions 


f>~(x,(T)  =  F'  (x)  0  <  x  <  1 

y  u,x 

d>  ,<J)~  +  0  as  x  +  . 

x  y 


(5.3) 


(5.4) 


22 


In  this  context, the  Kutta  condition  may  be  written  as 


[Vte  -  0  ' 


(5.5) 


where  TE  means  trailing  edge. 


Additionally,  shock  jump  conditions  must  be  imposed  in  order  to  have  a 
complete  problem  for 

The  form  of  (5.2)  shows  that  for  two  flows  at  different  transonic  Mach 

A 

numbers,  but  with  the  same  values  of  K,  these  two  flows  will  be 

geometrically  similar  (the  difference  will  be  that  the  "size*  of  the 

2/3 

disturbance  will  be  determined  by  different  factors  of  0  i.  An  analogous 
similarity  law  holds  for  the  gas  alone,  governed  by  the  similarity  parameter 
K,  Both  of  these  similarity  laws  relate  families  of  flows,  for  different 
values  of  the  displacement  thickness  at  a  corresponding  Mach  number. 

We  wish  to  see  how  the  change  in  the  ratio  of  the  specific  heats  for  the 

A 

dusty  gas,  y,  affects  the  flow.  This  will  be  done  by  finding  a 
correspondence  between  the  similarity  parameters  in  the  two  cases. 

We  shall  develop  the  correspondence  by  transforming  the  boundary  value 
problem  (5. 2)- (5. 5)  for  the  dusty  gas  into  a  problem  for  the  gas  without 


dust.  Define 


-2* 

K  =  <D  K 

~  I1/3 

y  =  (i)y  =6  y 


(5.6a) 


(5.6b) 


(5.6c) 


where  to  =  and  6  =  o>  6.  Then,  under  (5.6)  the  entire  problem 

for  <J>(x,y?K)  transforms  to  the  corresponding  problem  for  <J>(x,y?K),  which 
is  the  case  of  the  gas  without  dust.  The  similarity  parameter  K  for  the  new 


problem  is  of  the  form 
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,  3  x  .2/3  2  x2/3 

(w  5)  w  6 


from  which  we  conclude  M  -  M  . 

00  00 

Thus,  we  see  from  (5.6c)  that  the  small  disturbance  problem  for  a  dusty 

gas  is  equivalent  to  one  for  a  clear  gas,  with  a  modified  amplitude  of 

disturbances,  a  changed  wing  thickness  ratio  and  similarity  parameter. 

(0)  ,  (0) 

The  transformation  parameter  to,  as  a  function  of  a  f/pf  '  shown 

A 

in  Figure  5.1  for  c/(c  +  r)  =  1.0,  which  is  a  representative  value  for  many 
materials . 


u  ”  1.29  - 
m 


a(0,f/P<0) 


Figure  5.1.  to  vs.  ^f/pf  ^ 


Thus,  for  example,  a  dusty  flow  with  a  mass  loading  of  2.0  corresponds  to 
a  value  of  u>  of  1.21,  which  means  that  if  the  airfoil  has  thickness  6,  the 

flow  is  equivalent  to  that  of  a  clear  gas  around  an  airfoil  of  the  same  shape, 

-  3  - 

but  of  thickness  5  =  oj  6  =  1.776.  The  equivalent  similarity  parameter  K  is 

-2  A 

K  =  (jJ  K  —  0.68K. 
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ABSTRACT .  We  study  the  flow  of  two  immiscible  fluids  of  different 
viscosities  and  equal  density  through  a  pipe  under  a  pressure  gradient.  This 
problem  has  a  continuum  of  solutions  corresponding  to  arbitrarily  prescribed 
interface  shapes.  The  question  therefore  arises,  which  of  these  solutions  are 
stable  and  thus  observable.  Experiments  have  shown  a  tendency  for  the  thinner 
fluid  to  encapsulate  the  thicker  one.  This  has  been  "explained"  by  the 
viscous  dissipation  principle,  which  postulates  that  the  amount  of  viscous 
dissipation  is  minimized  for  a  given  flow  rate.  For  a  circular  pips,  this 
predicts  a  concentric  configuration  with  the  more  viscous  fluid  located  at  the 
core.  A  linear  stability  analysis,  which  is  carried  out  numerically,  shows 
that  while  this  configuration  is  stable  when  the  mere  viscous  fluid  occupies 
most  of  the  pipe,  it  is  not  stable  when  there  is  more  of  the  thin  fluid. 
Therefore  the  dissipation  principle  does  not  always  hold,  and  the  volume  ratio 
is  a  crucial  factor. 

I.  INTRODUCTION .  The  flow  we  consider  is  a  cylindrical  pipe  of  infinite 
length  in  which  there  are  two  fluids.  The  fluids  are  immiscible  and  have  the 
same  density  but  different  viscosities.  The  flow  is  steady,  purely  axial  and 
driven  by  a  prescribed  pressure  gradient. 

The  equations  governing  the  flow  are  the  steady  Navier-Stokes  equations 
with  the  velocity  and  the  pressure  gradient  in  the  axial  direction,  and 
incompressibility.  The  boundary  conditions  are:  no  slip  at  the  pipe  wall, 
and  at  the  interface  of  the  two  fluids,  which  one  of  the  unknowns,  the  normal 
and  shear  stresses  and  the  velocity  are  to  be  continuous.  We  specify  the 
ratio  of  the  cross-sectional  area  occupied  by  each  fluid  and  ask  the 
question:  what  shape  will  the  interface  be? 

Theoretically,  it  is  known  that  if  there  is  no  surface  tension,  every 
interface  position  is  allowed  by  the  equations.  If  there  is  surface  tension, 
then  the  interface  has  to  be  circles  or  circular  arcs  terminating  at  the  pipe 
wall*  The  number  of  possible  steady  solutions  is  still  infinite.  For 
example,  if  you  specify  that  fluid  1  occupies  1/3  of  the  area  and  fluid 
2  occupies  2/3  of  the  area,  2  possible  arrangements  are  shown  in  Figure  1. 
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Figure  1 


Such  nonuniqueness  appears  in  the  theory  of  steady  2-fluid  flows  for  all 
kinds  of  flow  regimes  [1].  On  the  other  hand,  experiments  with  the  pipe  flow 
indicate  that  whatever  the  initial  configuration,  the  low  viscosity  liquid 
will  eventually  encapsulate  the  thicker  fluid.  The  encapsulation  property  has 
been  observed  for  both  high  and  low  Reynolds  number  flows,  ranging  from  oil 
and  water  to  molten  polymers  [2,  3,  4,  5,  6,  1,  8,  9] - 

What  has  to  be  done  is  to  reconcile  the  existence  of  a  continuum  of 
solutions  with  the  experimentally  observed  unique  configuration.  Up  to  now, 
explanations  have  been  based  on  a  variational  method,  called  the  ’’viscous 
dissipation  principle"  which  says  that  the  flow  chooses  an  interface  which  in 
some  sense  minimizes  viscous  dissipation  for  a  given  flow  rate,  or 
equivalently,  maximizes  the  volume  flux  for  a  given  pressure  gradient  [2,  5, 

8,  10].  This  is  based  on  the  idea  that  there  is  work  to  be  done,  but  it  is 
harder  to  make  the  thick  fluid  do  the  work,  so  the  thin  fluid  does  it  by 
migrating  to  regions  of  high  shear.  The  thin  fluid  is  easy  to  push  around. 

Michael  Renardy  has  shown  that  for  a  pipe  with  arbitrary  cross-section, 
the  minimizer  of  viscous  dissipation  exists  if  there  is  an  a  priori  estimate 
for  the  length  of  the  interface  curve.  This  is  a  quantity  which  we  do  not 
know  how  to  obtain  [11].  Not  much  more  is  known  about  the  interface  for  flows 
in  pipes  of  arbitrary  cross-section.  However,  for  a  circular  pipe,  the 
analysis  is  much  simpler  because  of  the  symmetry  and  the  minimizer  turns  out 
to  be  the  concentric  configuration  with  the  thick  fluid  at  the  core.  This 
appears  to  agree  with  experiments.  Our  question  is:  how  valid  is  the  viscous 
dissipation  principle?  One  way  to  find  out  is  to  do  a  stability  analysis  for 
the  circular  pipe  to  see  if  the  configuration  preferred  by  the  viscous 
dissipation  principle  turns  out  to  be  stable, 

II.  NUMERICAL  CALCULATIONS.  We  did  a  linear  stability  analysis  for  the 
circular  pipe  where  the  basic  flow  is  the  Poiseuille  flow  with  a  concentric 
interface  (Figure  2).  Fluid  1  is  at  the  core,  fluid  2  encapsulates  fluid  1, 

,  .  _ ,  .  „  , ,  ,  .  .  i  ( -act+ctz+nG ) 

We  superimpose  an  infinitesimal  disturbance  (u,v,w,p)e  •  We  use 

a  Chebyschev  polynomial  expansion  in  the  radial  direction  [12],  The  problem 

is  then  an  eigenvalua  problem  for  c,  given  all  the  other  parameters.  If  the 

sign  of  the  imaginary  part  of  c  is  positive,  the  flow  is  unstable  to  small 

disturbances. 

The  particular  case  of  the  long-wave  limit,  a  x  Reynolds  number  +  0, 
and  the  thinner  fluid  at  the  core  was  studies  by  Hickox  [13]  and  was  shown  to 
be  unstable.  This  supports  the  viscous  dissipation  principle,  but  Hickox  did 
not  look  at  the  case  where  the  thicker  fluid  is  at  the  core  to  see  if  that 
would  be  stable. 
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Figure  2 


III*  RESULTS.  The  eigenvalue  which  determines  instability  is  an  interfacial 
one  in  the  sense  that  t  is  neutrally  stable  when  the  two  viscosities  are 
equal.  This  situation  is  not  identical  to  the  one-fluid  flow  because  of  the 
extra  conditions  at  the  interface.  Yih  [14]  found  similar  results  when  he 
looked  at  plane  Couette  flow  with  a  flat  interface  with  the  long-wave 
approximation. 

Our  range  of  parameters  is  the  following:  viscosity  ratio  ^rom 

0.2  to  8,  dimensionless  wavelength  of  axial  disturbance  ctR^  from  0.1  to 

10,  reference  Reynolds  number  R2V^R*|^V2  ^rom  ®  to  1000,  where  V(R,j) 

is  the  basic  velocity  at  the  interface  and  is  the  kinematic  viscosity, 

density  is  taken  to  be  1* 


First,  we  found  that  the  configuration  with  the  thin  fluid  at  the  core  is 
unstable.  This  extends  Hickox's  long-wave  results  and  agrees  with  the  viscous 
dissipation  principle.  Secondly,  when  the  thick  fluid  is  at  the  core, 
stability  depends  on  the  radius  ratio  R^/R2*  This  is  what  we  found  to  be 


most  interesting  because  it  shows  that  the  viscous  dissipation  principle  is 
not  always  true.  This  dependence  of  the  stability  on  the  radius  ratio  is 
qualitatively  similar  to  Yih*s  [14]  results  where  stability  depends  on  the 
depth  ratio  of  the  2  fluids.  Figure  3  shows  an  example  of  what  we  found  at 
Reynolds  number  100,  aR^  -  1. 


STABLE 


UNSTABLE 


Figure  3 


>  1 
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Figure  4  Is  a  graph  of  the  imaginary  part  of  c  versus  viscosity  ratio 
for  Re  =  100,  GR^  “  1,  Ry/Kj  "  Numbers  next  to  the  curves  denote 

azimuthal  mode  numbers.  The  dark  points  on  the  curves  show  our  computed 
values  and  the  dashed  lines  are  interpolants .  At  this  radius  ratio,  there  is 
a  slight  instability  due  to  the  higher  azimuthal  modes.  (Mode  5  becomes 

Ri 

positive  in  the  inset.)  For  —  >  0.7,  the  curves  sink  below  the  Im(c)  =  0 

Ri  2  ~ 

line  for  * —  >  1.  For  ~  <  0.7,  the  curves  rise  above  the  line,  yielding 
Vl2  K2  ~ 
the  results  in  Figure  3. 


Thirdly,  when  the  viscosity  ratio  is  large,  the  response  changes  only 

R1  V1 
gradually.  Figure  5  is  a  graph  at  —  “  0.9,  Re  =  100,  gR  -  1.  Whether  — 

2  2  P2 

is  6  or  7,  it  does  not  make  much  difference  to  the  imaginary  part  of  c. 

This  behaviour  has  been  mentioned  in  some  experiments  [2]. 


Figure  5 

Re  =  100,  GR^  =*  1,  72  0,9 
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Fourthly,  for  short  disturbance  wavelengths,  stability  is  lost.  We  did 
not  include  surface  tension  in  our  computations  and  that  would  dampen  some  of 
these  instabilities.  Figure  6  is  a  graph  of  im(c)  versus  viscosity  ratio 

Ri 

at  Re  -  100,  —  -  0.8,  aR  =  10.  When  aR.^  =  1,  the  region  of  stability  is 

p,  2 

—  >  1  but  for  ctR*  =  10,  the  modes  are  mostly  unstable.  This  agrees  with 

V2 

the  results  of  Hooper  and  Boyd  [15]  who  consider  the  linear  stability  of  an 
unbounded  Couette  flow.  The  2  fluids  occupy  each  half-plane.  Their  analysis 
is  relevant  locally  at  any  interface  with  a  viscosity  jump  and  predicts 
instability  for  short-wave  disturbances.  This  is  in  contrast  with  one-fluid 
flows  where  viscosity  acts  to  dampen  short  waves. 


Fifthly,  as  the  Reynolds  number  increases,  stability  is  lost.  Figure  7 


is  a  graph  of  Re  =  -1000,  r~  =  0.8, 
pi  2 

stability  is  —  >  1  but  when  Re 
**2 

region  of  stability  at  Re  -  1000 


aR^  =1.  When  Re  -  100,  the  region  of 

=  1000  the  0^  mode  is  unstable.  The 

for  Figure  7  is  for  — £1.8. 

V2  ~ 


IV.  CONCLUSION.  Our  conclusion  is  that  the  viscous  dissipation  principle  is 
not  always  true,  but  that  does  not  mean  we  are  saying  it  is  never  true.  From 
our  results  (Figure  3),  there  seems  to  be  some  truth  to  the  basic  idea  that 
the  thin  fluid  tends  to  lubricate  the  wall.  However,  if  there  is  a  large 
enough  amount  of  the  thin  fluid,  other  mechanisms  must  be  at  work.  Also,  the 
Situation  we  have  dealt  with  has  a  rotational  symmetry  and  symmetric  solutions 
to  symmetric  problems  always  have  a  special  status  so  that  it  is  quite  natural 
for  the  concentric  configuration  to  be  preferred  over  others  in  this  case, 
whatever  the  mechanisms  may  be. 
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Figure  7 

Imaginary  part  of  c  versus  viscosity  ratio 
Re  «  1000,  gr2  =  1,  R1/R2  =  0*8,  velocity  scale  -  1 
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ABSTRACT.  A  numerical  simulation  and  study  are  presented  for 

characterization  of  the  interaction  flow  of  a  water  jet  with  a  chemical 
contaminant  droplet  on  a  plane  wall,  which  occurs  in  chemical  decontamination 
processes.  The  model  for  this  analysis  is  a  two-dimensional,  two-fluid  flow 
governed  by  the  Navier-Stokes  equations.  Emphases  of  the  study  are  on  the 
evolution  of  the  contaminant  droplet  and  the  effects  of  variation  of  relevant 
flow  parameters  on  the  displacement  of  the  contaminant.  Computed  results 
show  that  a  jet  impingement  at  an  angle  of  incidence  in  the  range  of  45°  - 
60°  can  achieve  the  highest  cleaning  speed  of  the  contaminant.  The  results 
also  show  that  an  increase  in  either  the  jet  velocity  or  its  cross-sectional 
area  can  greatly  improve  the  cleaning  speed.  However,  for  a  given  jet  flow 
rate,  it  is  more  advantageous  to  increase  the  jet  velocity  rather  than  the 
cross-sectional  area  in  order  to  increase  the  efficiency  of  the  use  of  jet 
fluid. 


I.  INTRODUCTION.  Application  of  jet  spray  for  removal  of  chemical 
contaminants  of  solid  walls  has  long  been  recognized  to  be  effective.  The 
procedure  is  to  use  the  great  force  produced  by  the  turning  of  the  jet  stream 
to  displace  the  contaminant  which  is  in  the  form  of  droplets  along  the  walls. 

A  high-performance  jet  system  for  this  use  should  possess  the  following 
basic  feasures:  high  cleaning  speed  and  efficient  use  of  jet  fluid.  These 
two  features  are  particularly  important  when  the  system  is  operated  in  the 
field  where  it  is  often  required  to  decontaminate  an  area  in  the  shortest 
period  of  time  and  with  the  least  consumption  of  jet  fluid*  In  designing  such 
a  system,  knowledge  of  fundamental  characteristics  of  the  flow  is  vital,  such 
as  the  evolution  of  the  contaminant  droplet  and  the  effects  of  variation  of 
relevant  flow  parameters  on  the  jet  performance. 

The  contaminant  droplet  has  an  average  size  of  3  mm  in  diameter  and  0.6 
ram  in  height.  Its  density  is  approximately  the  same  as  that  of  plain  water, 
however.  Its  viscosity  may  vary  widely  from  10  to  1000  times  the  viscosity  of 
plain  water. 

The  interaction  flow  between  the  jet  and  the  contaminant  droplet  consists 
of  two  fluids,  the  jet  fluid  (water)  and  the  contaminant,  or  three  fluids  if 
the  ambient  is  treated  as  the  third  one.  The  two  prime  fluids  are  separated 
by  interfaces  and  have  free  surfaces  with  the  ambient.  The  flow  is  three- 
dimensional  in  nature  and  is  highly  transient.  Most  of  the  investigations 
conducted  in  the  area  of  jet  impingement  in  the  past  are  relevant  to  the  VT0L 
program  (vertical  takeoff  and  landing  aircraft)  or  rocket  exhaust  flows,  and 
are  concerned  with  impingements  on  a  solid  surface  [  1 ,2 ,3,4 ,5 ,6] .  For 
impingements  on  a  liquid  surface,  Hund  [7]  and  Vanden-Broeck  [8]  considered 
steady  and  two-dimensional  cases.  They  used  simplified  theories  to 
characterize  the  wave-like  hydrodynamic  instability  occurlng  at  the  interface 
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of  the  two  fluids,  but  gave  no  predictions  on  the  velocity  and  pressure 
distributions  in  the  flow  field. 


In  the  present  investigation,  we  have  simplified  the  analysis  by  treating 
the  jet-contaminant  flow  as  a  two-dimensional  problem.  We  have  developed  a 
two-fluid  flow  model  suitable  for  characterization  of  the  flow  and  adoption  of 
the  existing  computer  code  SOLA-VOF  [9]  for  numerical  solutions.  The  flow  is 
governed  by  the  unsteady  Navier-Stokes  equations.  Presented  are  the  evolution 
of  the  contaminant  droplet  and  effects  of  variation  of  relevant  flow 
parameters  on  the  jet  performance  in  displacing  the  droplet. 

II,  FLOW  MODEL.  Figure  1  depicts  a  pre-impingement  flow  configuration 
which  practically  occurs  in  the  decontamination  process.  A  water  jet  is 
directed  at  a  contaminant  droplet  which  is  initially  covered  by  a  thin  water 
layer  on  a  plane  wall.  To  characterize  the  flow  developing  from  this 
configuration  following  the  impingement  we  have  developed  a  two-dimensional, 
two-fluid  viscous  flow  model  in  Figure  2. 


The  flow  region  in  the  model  is  essentially  the  region  enclosed  by  the 
dashed  line  indicated  in  Figure  1,  covering  the  major  part  of  the  flow 
field.  It  resembles  a  channel  flow  containing  two  fluids  (water  and 
contaminant)  separated  by  an  Interface.  Both  fluids  are  assumed  to  be 
newtonian  and  the  surface  tension  effect  along  the  interface  is  neglected 
because  of  its  small  magnitude.  The  upper  boundary  of  the  channel  coincides 
with  the  upper  free  surface  of  the  water  layer  so  as  to  eliminate 
consideration  of  the  free  surface  interface  with  the  ambient.  An  outflow 
condition  is  specified  at  this  boundary  and  at  the  ends  of  the  channel, 
allowing  the  fluids  to  flow  out  of  the  region.  The  contaminant  which 
initially  occupies  the  shaded  rectangular  region  is  assumed  to  wet  perfectly 
the  bottom  wall  of  the  channel.  To  account  for  viscous  effects,  a  no-slip 
condition  is  used  for  the  bottom  wall.  Finally,  a  steady  uniform  jet  velocity 
at  an  angle  of  0  is  specified  along  a  segment  of  the  upper  boundary  as  shown 
in  the  figure.  The  lower  left  side  of  the  Figure  2  shows  part  of  the 
computational  domain  of  the  flow  field.  The  domain  is  discreted  into  a  100  x 
12  mesh  with  finer  zoning  near  the  lower  wall  to  ensure  better  accuracy  of 
computation  in  this  thin  layer. 


III.  FLOW  EQUATIONS  AND  METHOD  OF  SOLUTION.  The  governing  equations  for 
the  model  flow  are: 


continuity 


momentum 


(1) 

(2) 

(3) 


where  t  is  time  variable,  u  and  v  are  the  x-component  (along  the  channel)  and 
the  y-component  (normal  to  the  channel)  of  the  flow  velocity,  respectively. 
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The  density  p,  the  sound  speed  c,  and  the  viscosity  v,  are  constant.  In 
addition,  a  function  F,  called  the  fractional  volume  of  fluid  function,  is 
introduced  for  tracking  the  water-contaminant  interface.  The  function  is 
given  as 


0 


(4) 


This  equation  states  that  F  moves  with  the  fluid.  In  a  two-fluid  flow  the 
value  of  F  is  unity  at  any  point  occupied  by  the  first  fluid  (say, 
contaminant)  and  zero  elsewhere.  When  averaged  over  the  cells  of  a 
computational  mesh,  the  average  value  of  F  in  a  cell  is  equal  to  the 
fractional  volume  of  the  cell  occupied  by  the  first  fluid.  In  particular,  a 
unity  value  of  F  corresponds  to  a  cell  full  of  the  first  fluid,  whereas  a  zero 
value  indicates  that  the  cell  contains  only  the  second  fluid  (say,  water 
including  the  jet  fluid  and  the  water  layer  of  Figure  1).  Cells  with  F  values 
between  zero  and  one  contain  an  interface,  as  illustrated  in  Figure  3.  With 
this,  the  interface  separating  the  two  fluids  can  be  tracked. 

The  velocity  components  u  and  v  in  the  momentum  Eqs.  (2)  and  (3)  have 
been  solved  by  using  the  explicit  finite  difference  scheme,  while  the  pressure 
p  has  been  computed,  coupled  with  the  continuity  equation  (Eq.  (1)),  via  an 
implicit  finite  difference  method.  The  solution  of  the  F  function  in  Eq.  (4) 
has  been  obtained  by  using  the  Donor-Accepter  flux  approximation.  Details  of 
the  solution  method  have  been  given  in  Reference  9  of  this  paper.  In  order  to 
observe  the  evolution  (location  and  shape)  of  the  region  covered  by  the 
contaminant  droplet.  Marker  Particles  have  been  embedded  in  the  fluid  and  move 
with  it,  but  do  not  affect  the  fluid  dynamics. 

Numerical  computations  have  been  carried  out  by  employing  the  SOLA-VOF 
code  (9).  In  the  current  version  of  the  code,  the  viscosities  of  both  fluids 
in  a  flow  are  considered  the  same  or  simply  zero.  To  adapt  this  code  for 
solving  the  present  flow  which  involves  two  fluids  with  very  different 
viscosities  we  have  implemented  the  following  viscosity  relationship  into  the 
code. 


v  =  v  F  +  (1  -  F)  v  (5) 
c  w 

where  v  is  the  kinematic  viscosity  of  fluid  in  a  cell,  v  the  kinematic 
viscosity  of  the  first  fluid  (contaminant),  v  the  kinematic  viscosity  of  the 
second  fluid  (jet  fluid),  and  F  the  function  defined  in  Eq,  (4).  Similarly, 
the  density  in  a  cell  is  approximated  to  be 


p  «  p  F  +  (1  -  F)  Ptt  (6) 
c  w 

where  and  are  the  density  of  the  contaminant  and  plain  water, 
respectively.  From  Eqs.  (5)  and  (6),  we  see  that  the  values  of  v  and  p  in  a 
cell  are  functions  of  F. 
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Finally,  it  is  noted  that  the  Reynolds  numbers  based  on  the  jet  thickness 
and  the  jet  velocities  used  in  our  computations  are  in  the  range  of  20  - 
2000.  Within  this  range,  Eqs.  (2)  and  (3)  are  felt  to  be  appropriate  for  the 
present  flow  analysis,  even  though  the  equations  do  not  include  turbulence 
considerations . 

IV.  COMPUTATIONAL  RESULTS  AND  DISCUSSIONS.  The  following  are  the  input 
data  for  our  computations: 


0 


incidence  angle  of  the  jet  =  19.8°  -  90° 


Dj 


diameter  of  the  jet  (thickness  of  the  jet  in 
the  present  two-  dimensional  model)  =1.83  mm 


v 

w 


v 

c 


=  jet  velocity,  uniform  across  the  jet  =  5-10 

m/sec 

-  density  of  plain  water  =  0.001  Kg/cm 

=  density  of  the  contaminant  =  0.00107  Kg/cm^ 

=  Kinematic  viscosity  of  water  =  0.0098  cm  /sec 

=  kinematic  viscosity  of  the  contaminant  =  0.098  -  9.8 
cm^/sec 

Droplet  size:  3  mm  x  0.6  mm  (length  x  height) 


Computed  results  are  presented  as  follows: 

A.  Flow  Patterns ♦  Figure  4  presents  two  typical  flow  developments 
following  the  commencement  of  the  jet  flow  up  to  0.2  millisecond.  They 
correspond  to  the  contaminant  viscosities  v  =  0.098  cm^/sec  (=  10  y  )  and 
v  =  9.8  cm^/sec  (=  1000  v  ),  respectively.  It  is  noted  that  all  of  the 

graphs  in  the  figure  have  been  magnified  by  a  factor  of  3  in  the  vertical 
direction  in  order  to  provide  a  better  flow  visualization  near  the  bottom  wall 
of  the  channel.  The  vectors  indicated  in  the  flow  region  represent  the  local 
fluid  velocities  in  the  individual  cells  of  the  computational  mesh.  In  the 
low  viscosity  case  the  jet  stream  toward  the  droplet  is  very  much  parallel  to 
the  bottom  wall,  while  in  the  high  viscosity  case  the  jet  stream  is  lifted  off 
the  wall.  It  is  interesting  to  note  that  as  the  time  progresses,  part  of  the 
fluid  near  the  downstream  edge  of  the  droplet,  especially  in  the  high 
viscosity  case,  moves  toward  the  bottom  wall  behind  the  droplet.  Apparently, 
a  low  pressure  region  is  created  behind  the  lower  portion  of  the  droplet, 
similar  to  the  flow  region  appearing  behind  an  obstacle  which  stands  in  a 
flow.  Figure  5  presents  another  view  of  the  revolution  of  the  two  droplets. 
The  viscosity  effect  is  pronounced  as  seen  when  comparing  the  profiles  and  the 
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displacements  S  of  the  two  droplets.  We  also  observe  that  at  t  =  0,2 
millisecond  all  fluid  particles  inside  the  low  viscosity  droplet  have  been 
disturbed,  while  in  the  high  viscosity  droplet  there  is  a  region  near  the 
wall  in  which  fluid  particles  remain  in  good  order. 

B.  Optimum  Jet  Incidence  Angle.  The  incidence  angle  of  the  jet 
impingement  should  play  a  key  role  in  affecting  the  jet  performance  in 
displacing  the  contaminant  droplet.  A  small  incidence  angle  will  result  in  a 
large  portion  of  the  jet  stream  moving  toward  the  contaminant  droplet,  but  a 
small  normal  force  to  displace  the  contaminant  near  the  wall  as  a  result  of  a 
small  rate  change  of  momentum  in  the  impingement  area.  A  large  incidence 
angle,  on  the  other  hand,  will  produce  a  reverse  result.  Therefore,  there  is 
an  optimum  angle  at  which  a  jet  impingement  can  perform  best  for  removal  of 
the  contaminant. 

In  order  to  optimize  the  incidence  angle,  several  angles  ranging  from 
19.8°  to  90°  have  been  used  to  compute  the  displacement  of  the  droplet  as  a 
function  of  time  following  the  impingement  process.  At  each  angle  the  jet  is 
properly  located  such  that  the  impingement  will  need  the  shortest  time  to 
displace  the  upstream  edge  of  the  droplet  a  prescribed  distance,  say,  one 
third  of  the  original  droplet  length. 

Figures  6  and  7  show  the  results  for  the  jet  velocities  of  5  m/sec  and  10 
m/sec  respectively.  The  dashed  lines  represent  the  results  for  the  case  that 
the  contaminant  viscosity  is  equal  to  10  times  viscosity  of  plain  water 
(vc  ■  10  v*  )  and  the  solid  lines  for  v  -  1000  V  .  As  seen,  the 

W  C  W 

displacement  S  at  a  given  time  t  increases  with  the  incidence  angle  0  until 
0  reaches  56.28°.  Beyond  that  angle,  S  falls  except  in  the  case  that  =  5 
m/sec  and  =  1000  shown  in  Figure  6.  When  the  water  layer  which 

initially  covers  the  droplet  as  shown  in  Figure  3  is  increased  from  0.2  mm  to 
0*6  mm,  the  result  shown  in  Figure  8  also  indicates  that  0  =  56.28°  results  in 
a  better  jet  performance  than  any  other  smaller  angle.  An  important 
conclusion  then  can  be  drawn  that  for  a  given  jet  velocity  and  a  given  jet 
size  (i.e.,  the  cross-sectional  area  of  the  jet)  a  jet  impinging  at  an  angle 
around  56°  can  achieve  the  highest  cleaning  speed  of  the  contaminant. 

The  efficiency  of  the  use  of  jet  fluid,  as  well  as  the  cleaning  speed,  is 
of  concern  since  in  some  areas,  especially  in  the  battlefield,  the  supply  of 
the  jet  fluid  could  be  very  limited.  Thus  we  need  a  system  capable  of 
decontaminating  a  surface  area  with  the  least  consumption  of  jet  fluid. 

^  In  Figure  9,  A  denotes  the  surface  area  which  has  been  decontaminated  and 

V  the  volume  of  jet  fluid  consumed  for  the  decontamination  of  the  area.  Then 

A/V  is  called  the  nominal  area  cleaned  per  unit  volume  of  jet  fluid  consumed 

and  thus  can  represent  the  efficiency  of  the  use  of  the  jet  fluid.  The  figure 

explicitly  shows  that  a  jet  at  an  angle  of  incidence  between  45°  and  60°  from 
the  impingement  surface  will  achieve  the  most  efficient  use  of  jet  fluid. 
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C.  Improvement  of  Jet  Performance  by  Increasing  the  Jet  Velocity  or 
the  Jet  Size.  The  jet  velocity  and  its  cross-sectional  area  are  another  two 
important  flow  parameters  governing  the  performance  of  the  jet  impingement  for 
decontamination  applications. 


In  _  Figure  10  a  comparison  between  curve 
curve  "a  M  and  curve  “b 


and  curve 


or  between 


indicates  that  an  increase  in  the  jet  velocity 
from  7.27  m/sec  to  10  m/sec  accelerates  the  cleaning  speed  substantially. 
Similar  results  can  be  obtained  when  the  jet  size  Dj  (i.e.,  the  thickness  of 
the  jet  in  the  two-dimensional  case)  is  enlarged  as  ^seen  from  a  comparison 
between  curve  "a  "  and  curve  "b  "  or  between  curve  "a  "  and  curve  4,b  ".  A 

question  then  arises:  Which  is  the  better  means  to  improve  the  cleaning 

speed  provided  that  the  jet  flow  rate  (Q  =  Vj  )  is  the  same,  by  increasing 
the  jet  velocity  or  increasing  the  jet  size*  J  For  the  answer,  we  first  compare 
curve  "a"  with  curve  "a  As  listed  at  the  right  corner  of  the  figure,  both 
curves  correspond  to  the  same  jet  flow  rate  Q  -  0.0183  nfVsec,  however,  curve 
"a"  resulted  from  a  jet  velocity  twice  that  of  curve  "a  "  but  from  a  jet  size 
half  that  of  curve  **a  " •  The  comparision  between  them  reveals  that  the  one 
(curve  "a")  resulted  from  a  higher  jet  velocity  provides  a  highej  cleaning 
speed.  This  is  also  true  when_we  compare  curv^  "a  "  with  curve  "a  ",  curve 


"b"  with  curve  "b 


or  curve 


with  curve  b 


Next  we  examine  the  increases  of  the  jet  velocity  and  the  jet  size  in 
relation  to  the  efficiency  of  the  use  of  jet  fluid.  In  Figure  11  the  upper 
curve  of  each  set  of  the  curves  is  the  result  from  an  Increase  in  jet  velocity 
Vj  from  5  m/sec  to  10  m/sec,  while  the  jet  size  is  held  constant,  Dj  =  d  = 
1.83  mm.  The  lower  curve  of  each  set  is  the  result  from  an  increase  in  jet 
size  from  d  to  2d,  while  the  jet  velocity  remains  the  same,  Vj  =  5  m/sec.  We 
see  that  the  efficiency  of  the  use  of  jet  fluid,  represented  by  A/V  , 
increases  with  the  increasing  jet  velocity,  but  decreases  with  the  increasing 
jet  size.  The  figure  shows  that  at  a  given  jet  flow  rate  0  »  the  jet  with  an 
increased  jet  velocity  performs  more  efficiently  in  terms  of  consuming  less 
jet  fluid  to  decontaminate  a  given  area. 


As  a  remark,  the  above  results  obtained  from  the  present  two-dimensional 
(2D)  flow  model  are  also  applicable  to  the  three-dimensional  (3D) 
impingement*  Unlike  in  the  2D  impingement  in  which  most  of  the  jet  fluid  from 
the  increased  jet  size  flows  toward  the  contaminant,  only  a  small  amount  of  it 
moves  toward  the  contaminant  in  the  3D  case.  Accordingly,  in  the  3D 
impingement  an  increase  in  the  jet  size  is  less  effective  than  in  the  jet 
velocity  in  improving  the  jet  performance.  In  fact,  when  a  number  of  3D  jets 
are  arrayed  in  a  line,  they  resemble  a  2D  jet. 

V.  Summary  and  Conclusion.  A  two-dimensional,  two-fluid  model  has  been 
developed  for  characterization  of  the  jet-contaminant  interaction  flow  on  a 
plane  wall  which  occurs  in  chemical  decontamination  processes.  Computer 
graphs  are  presented  to  show  typical  flow  developments  following  the 
impingement  process.  The  contaminant  viscosity,  the  jet  incidence  angle,  the 
jet  velocity,  and  the  jet  size  have  been  demonstrated  to  be  important  flow 
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parameters  in  governing  the  flow  pattern  and  the  performance  of  the  jet 
impingement  for  decontamination  application. 

The  computer  graphs  show  that  in  the  high  viscosity  contaminant  case  a 
thick  viscous  layer  develops  quickly  inside  the  flow  region  covered  by  the 
contaminant  droplet  and  the  jet  stream  is  lifted  off  the  impingement 
surface.  From  the  computed  results  we  have  found  that  the  optimum  jet 
incidence  angle  for  achieving  the  highest  cleaning  speed  of  the  contaminant 
and  the  most  efficient  use  of  the  jet  fluid  is  in  the  range  of  45°  -  60°.  An 
increase  either  in  the  jet  velocity  or  in  the  jet  size  can  greatly  improve  the 
cleaning  speed*  However,  if  the  jet  flow  rate  is  the  same,  an  increase  in  the 
jet  velocity  rather  than  the  jet  size  will  raise  the  efficiency  of  the  use  of 
jet  fluid. 
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Figure  1.  Flow  configuration  before 
impingement 
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Figure  2.  Model  for  numerical  computation 
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Figure  4.  Flow  patterns  (two-fluid  flow,  =  0.0098  cm. /sec, 
Vj  =  5  m/sec,  0  =  45°) 


DROPLET  UPSTREAM  EDGE  DISPLACEMENT,  S  mm 


Figure  6.  Droplet  upstream  edge  displacement,  S, 
vs.  time  after  commencement  of  jet  flow,  t,  for 
various  jet  incidence  angles,  0  (jet  velocity  = 
5  m/sec) 
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TIME,  t  MS 


Figure  7.  Droplet  upstream  edge  displacement, 

S,  vs*  time  after  commencement  of  jet  flow,  t, 
for  various  jet  incidence  angles,  0  (jet  velocity 

V.  =  10  m/sec) 

1 
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Figure  8.  Droplet  upstream  edge  displacement,  S, 
vs.  time  after  commencement  of  jet  flow,  t  (water 
layer  thickness  is  increased  from  0.2  mm  to  0.6  mm) 
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JET  INCIDENCE  ANGLE,  0  degrees 


Figure  9.  Nominal  area  cleaned  per  unit 
volume  of  jet  fluid  consumed,  A/V*,  for  various 
jet  incidence  anyles,  0 
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Figure  10.  Droplet  upstream  edge  displacement,  S, 
vs.  time  after  commencement  of  jet  flow,  t,  for 
various  combinations  of  jet  velocity  and  jet  size 
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Figure  11.  Nominal  area  cleaned  per  unit  vlume  of  jet 
fluid  consumed,  A/V*,  for  various  jet  flow  rate,  6 
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High  viscosity  liquids  hate  to  work .  Low  viscosity  liquids 
are  the  victims  of  the  laziness  of  high  viscosity  liquids  because 
they  are  easy  to  gush  around . 


ABSTRACT.  The  arrangement  of  components  in  steady  flow  of 
immiscible  liquids  is  typically  nonunique.  The  problem  of  selection 
of  arrangements  is  defined  here  and  is  studied  by  variational 
methods  under  the  hypothesis  that  the  realized  arrangements  are  the 
ones  which  maximize  the  speed  on  exterior  boundaries  for  pre¬ 
scribed  boundary  tractions,  or  the  ones  which  minimize  the . tractions 
for  prescribed  speeds.  The  arrangements  which  minimize  tractions 
also  minimize  the  dissipation  by  putting  low  viscosity  liquid 
in  regions  of  high  shear.  The  variational  problem  is  used  as  a 
guide  to  intuition  in  the  design  and  interpretation  of  experiments 
when  results  of  analysis  of  stability'  are  unavailable.  In  fact  we 
always  observe  some  kind  of  shielding  of  high  viscosity  liquid. 

This  can  occur  by  sheet  coating  in  which  low  viscosity  liquid 
encapsulates  high  viscosity  liquid,  or  through  the  formation 
of  rigidly  rotating  masses  of  high  viscosity  liquid  which  we  call 
rollers.  In  other  cases  we  get  emulsions  of  low  viscosity  liquid 
in  a  high  viscosity  foam.  The  emulsions  arise  from  a  fingering 
instability.  The  low  viscosity  liquid  fingers  into  the  high 
viscosity  liquid  and  then  low  viscosity  bubbles  are  pinched  off 
the  fingers.  The  emulsions  seem  to  have  a  very  low  effective 
viscosity  and  they  shield  the  high  viscosity  liquid  from  shearing. 

In  the  problem  of  Taylor  instability  with  two  fluids  low  viscosity 
Taylor  cells  are  separated  by  stable  high  viscosity  rollers, 

1.  INTRODUCTION.  We  are  interested  in  the  flow  of  two 
immiscible  liquids  separated  by  an  interface,  driven  by  prescribed 
forces  of  the  usual  type.  We  call  such  motions  bicomponent  or  two 
phase  flows.  In  fact,  we  do  not  consider  two  phases  of  the  same 
material,  but  of  separate  liquids  which  do  not  mix  (oil  and  water, 
for  example).  To  fix  our  problem,  we  specify  the  total  volume  of, 
the  two  fluids  and  the  individual  volumes  occupied  by  each  one  of 
them.  Then  our  problem  is  to  describe  the  motion  and  the  spatial 
arrangement  of  each  component.  There  is  a  high  degree  of  nonuniqueness 
in  such  problems,  even  when  the  motion  is  steady  and  even  when  the 
region  of  flow  is  bounded  and  the  Reynolds  numbers  for  each  of  the 
fluids  is  very  small.  In  some  cases  we  nay  find  a  class  of  steady 
motions:  no  motions,  Couette  motions,  Poiseuille  motions,  etc., 
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which  are  uniquely  determined  by  the  data  up  to  an  arbitrary 
rearrangement  of  components  (phases) .  The  arrangement  of  components 
which  is  actually  achieved  in  the  flow  is  certainly  connected  to  the 
problem  of  hydrodynamic  stability*  However,  it  has  been  suggested 
that  in  some  problems  the  configurations  which  are  achieved  are 
those  which  in  some  sense  extremalize  the  viscous  dissipation.  In 
fact  a  more  precise  statement  of  this  apparently  mystical  idea 
can  be  formulated  in  terms  of  fluxes  and  forces.  The  idea  is  to 
maximize  the  flux  for  a  given  force  (or  to  minimize  the  force 
for  a  given  flux)  over  an  admissible  class  of  phase  arrangements  in 
a  set  of  nonunique  steady  solutions.  Our  experiments  show  that 
something  like  this  is  going  on.  The  arrangements  of  the  components 
do  in  fact  appear  to  be  ones  which  extremalize  in  some  mathematical 
sense.  The  extremalizing  configurations  are  such  as  to  minimize 
the  shearing  of  high  viscosity  liquids  by  the  spontaneous  migration 
of  low  viscosity  liquids  into  regions  where  the  shearing  is  greatest. 

The  experiments  reported  in  this  paper  seem  to  be  of  extra¬ 
ordinary  interest  in  that  they  exhibit  previously  unknown  types 
of  fluid  dynamics  which  may  be  typical  for  flows  of  immiscible  liquids. 
Our  experiments  are  visual  and  qualitative.  In  the  future  it  would 
be  good  to  systematically  monitor  torques  and  speeds  and  to  under¬ 
take  to  correlate  observations  with  systematic  variations  of  geo¬ 
metric  parameters. 

2.  EQUATIONS  OF  MOTION.  The  equations  of  motion  in  each 
liquid  are  the  usual  ones.  We  are  going  to  study  Navier-Stokes 
fluids  in  this  paper  but  we  are  not  yet  certain  that  Navier-Stokes 
dynamics  suffices  to  explain  all  that  we  have' observed.  The  stress  T 
is  given  by 


(2,1)  T  =  -p  1  +  S,  S  =  2-jD[u] 

where  D  is  h  of  the  symmetric  part  of  the  velocity  gradient  vu 
and  for  all  fluids  in  all  regions  of  flow  — 


(2.2)  div  u£  =  0,  1,2  , 

C2-3)  0-^  U„  =  -  VV  div  SA  ,  St  «  D[u£]  , 

(2.4)  *£  =  P*g  Z .+  p£ 

where  g  is  gravity,  Z  increases  against  g,  p1  and  p2  are  the  densities 

of  the  first  and  second  fluids  and  and  are  the  viscosities. 

The  interface  between  the  regions  1  and  2  is  called  I  and  is 
given  by 

(2.5)  f  (x(t)  ,t)  =  0 
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and,  since  this  is  an  identity  in  t 


(2.6) 


u  *  Vf 


0 


where  we  have  assumed  that  the  normal  component  of  velocity  dx/dt  of 
the  surface  Z  and  the  particles  of  fluid  on  either  side  of  Z  are 
the  same.  In  fact,  the  velocity  u  is  continuous  across  Z 


(2.7)  lu-1  =  ux  -  u2  =  0. 


Now  we  state  the  conditions  of  continuity  for  the  stresses 
across  I.  Let  n  =  Vf/ | Vf |  be  the  normal  to  Z  and 
t.,  be  two  orthonormal  vectors  in  Z-  The  jump  across  I  of 

the  traction  [T]  •  n  satisfies 

(2.8)  IT  I  •  n  +  VjjC  +  2Han  ~  0 

where  VTT  is  a  surface  gradient,  2H  is  the  sum  of  the  principal 
curvatufes  and  a  is  surface  tension.  Now,  using  (2.4)  we  introduce 
the  head  <t, 

(2.9)  [Til  =  -  tpll+  IS  U  -  g  loj  55e1+  USD 


where  Z£  is  the  value  of  the  coordinate  Z  on  Z.  We  project  (2.8), 
using  (2,9),  with  n  and  t^,  to  find  that 

(2.10)  - 1  $j  +  n  -  tsi  *  n  +  g  Z^IpI  +  2Ha  =  0  , 

(2.11)  T^HSJ  •  n  +  t %  •  Vzla  »  0  ,  A-l,  2  . 

This  statement  of  conditions  is  completed  by  stating  boundary  and 
other  auxilliary  conditions.  In  general  the  position  of  the 
interface,  f(x,t)  »  0  is  an  unknown,  to-be-determined  quantity. 

In  many  cases,  even  after  specifying  the  volumes  occupied  by  each 
fluid,  the  solutions  of  the  equations  are  not  uniquely  determined  by 
the  boundary  data.  In  fact,  there  is  a  very  high  degree  of  non¬ 
uniqueness  which  we  shall  now  specify  more  precisely. 
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3.  HYDROSTATIC  CONFIGURATIONS  WHICH  ARE  NOT  UNIQUE .  Immiscible 
liquids  of  the  same  density  will  form  spheres  of  one  liquid  in  another. 
The  steady  distribution  of  such  spheres,  their  sizes  and  their 
placement,  seems  not  to  be  unique.  In  fact  Plateau  (1873)  suspended 
masses  of  olive  oil  in  a  mixture  of  alcohol  (lighter  than  oil)  and 
water  (heavier  than  oil)  of  the  same  density.  By  the  latter 
device  spheres  of  many  centimeters  were  obtained.  It  is  possible 
to  have  big  spheres  of  oil  in  alcohol-water  and  big  spheres  of 
alcohol-water  in  oil.  For  such  spheres  we  satisfy  the  equations  of 
§2  with 


u  —  _S  -  V  a  =  ([pi  =  0 


and 


2  Ha  =  DL$J  =  tpl  =  -g- 


where  R  is  the  radius  of  each  sphere.  There  is  nothing  here  to 
determine  the  size  and  placement  of  spheres  from  given  data. 

The  type  of  stationary  configuration  with  bubbles  of  different 
sizes  in  immiscible  fluids  of  matched  density  which  was  studied  by 
Plateau  is  shown  in  Plate  3,1,  This  plate  shows  the  result  of 
matching  the  density  of  dibutyl  phthalate  with  a  glycerine  (heavier) 
and  water  (lighter)  solution. 

The  nonuniqu-e  stationary  configurations  which  are  achieved  by 
matching  density  appear  to  be  stable  to  small  disturbances.  However, 
there  is  a  tendency  for  bubbles  which  touch  to  collapse  into  one 
bubble.  It  may  be  true  that  there  is  a  selection  mechanism  based  on 
stability  to  large  disturbances  in  which  the  stable  configuration 
is  the  one  which  minimizes  surface  area.  This  type  of  criterion 
leads  to  large  bubbles,  even  one  large  one,  rather  than  many  small 
ones. 

4,  NONUNIQUE  PLANE  COUETTE  FLOW  IN  LAYERS.  Another  case  of 
nonuniqueness  of  steady  flow  of  two  immiscible  fluids  is  plane 
Couette  flow  in  layers.  Such  flows  are  like  heat  conduction  in  layered 
materials  and  are  characterized  by  alternate  layers  of  fluids  whose 
speed  and  shear  stress  is  matched  at  each  interface.  The  flowing 
fluid  with  velocity  u  -  e^  u^(y),  density  and  viscosity  occupies 

a  height  per  unit  area  in  the  plane  of  flow,  the  second  fluid 
occupies  a  height  &2  Per  unit  area  where  33  &  is  the  total 

height  of  the  layers  and  the  two  fluids  may  have  any  number  of  con¬ 
tiguous  layers  as  in  Fig.  4.1.  The  velocity  functions 

(4.1)  uly)  =  ay  +  b 


are  linear  in  each  layer  and  satisfy  the  conditions 
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The  normal  stress  balance  is  automatically  satisfied  by  the  fields 
(4.1),  on  flat  interfaces,  whether  the  surface  tension  is  small  or 
large.  It  matters  not  whether  the  densities  are  the  same  or 
different.  If  they  are  different  then  the  steady  layered  flow 
exists,  but  is  unstable,  with  all  the  heavy  stuff  falling  to  the 
bottom.  If  the  densities  are  the  same,  no  one  knows  what  is  the 
preferred  arrangement  of  the  layers.  In  fact  Yih  (1967)  has 
shown  that  many  of  the  configurations  of  flow  with  two  layers 
are  stable  to  small  disturbances  in  the  form  of  long  waves  and 
many  are  unstable  to  the  same  type  of  disturbance  as  the  Reynolds 
number  tends  to  zero.  Stability  depends  on  the  viscosity  ratio  and 
the  volume  ratio.  So  non-uniqueness  of  layered  Couette  flow  .remains 
even  after  analysis  of  stability  eliminates  some  steady  arrangements. 
In  the  case  of  instability,  Yih  says  that  the  interface  becomes 
wavy.  It  would  be  interesting  to  study  the  bifurcation  of  layered 
Couette  flow  with  flat  interfaces  into  shear  flow  with  wavy  inter¬ 
faces.  If  such  solutions  do  bifurcate  there  would  be  a  yet  greater 
degree  of  nonuniqueness. 


Fig.  4,1:  Layered  Couette  flow.  The  high  viscosity  fluid  is  the 
one  with  viscosity  y2  >  y^.  The  two  layers  with  fluid 

2  have  a  total  height  equal  to  anc^  the  two  layers  with 

fluid  1  have  a  total  height  of  SL^f  ^  ^  =  S'~  The 

velocity  profiles  are  linear  functions  of  y,  which  in¬ 
crease  in  a  continuous  way  from  u(0)  -  0  to  u  ( JL)  =  U. 
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5,  NQNUNIQUE  PLANE  POISEUILLE  FLOW  IN  LAYERS .  Another  case 

of  two  immiscible  fluids  is  plane 
set-up  is  the  same  as  in  §4,  but 
pressure  gradient 


top  wall.  Hence,  in  each  layer 


=  o  and  u(0)  =  u(Ji)  =  0. 
solutions  in  which  the  fluid  with 
viscosity  \i ^  is  in  N  layers  of  total  height  separated  by  layers 

of  fluid  with  viscosity  y2  whose  total  height  is  •  The  heights 

of  the  constituent  layers  and  their  number  are  otherwise  arbitrary. 

The  remarks  made  about  density  differences  in  §4  apply  in  full 
force  here* 

Yih  (1967)  has  also  considered  the  stability  of  such  flow 
of  two  fluids  with  different  viscosities  in  two  layers  separated 
by  one  interface.  He  finds  that  Poiseuille  flow  in  two  layers  is 
always  unstable  to  long  waves  even  for  small  Reynolds  numbers 
tending  to  zero.  Yih  does  not  raise  the  problem  of  preferred 
arrangement  of  constituents.  In  fact  it  is  probable  that  among  the 
continuum  of  layered  Poiseuille  flows  the  one  with  the  greatest 
stability  is  that  for  which  the  high  viscosity  fluid  is  centrally 
located  (see  Fig.  5.1).  This  configuration  maximizes  the  mass  flux 
for  a  given  G,  Z^  and  requires  the  study  of  flows  with  at  least 

three  layers,  not  admitted  into  the  analysis  of  Yih.  The  variational 
problem  associated  with  the  problem  of  the  preferred  arrangement 
of  constituents  is  formulated  in  §11,  and  a  solution  of  this  problem 
for  bicomponent  Poiseuille  flow  in  pipes  is  given  in  a  paper  by 
Joseph,  Renardy  and  Renardy  (1983)  which  is  a  companion  to  this  one. 

We  shall  refer  to  this  companion  paper  as  JRR(1983). 

6.  NONUNIQUE  POISEUILLE  FLOW  IN  PIPES  OF  ARBITRARY  CROSS-SECTION. 
Another- case"  of  nonuniqueness  of  steady  flow  of  two  immiscible 
fluids  is  Poiseuille  flow  in  pipes  of  arbitrary  cross-section.  We 
shall  imagine  that  the  fluids  have  the  same  density  but  different 
viscosities.  The  cross-section  of  the  pipe  is  called  Q.  and  the 
coordinates  in  Q  are  (y,z)  with  x  increasing  along  generators  of  the 
pipes.  The  total  cross-sectional  area  occupied  by  each  of  the 
two  components  ^  and  +  ^1  =  anc^  t*ie  Pressure  gradient 

(-G)  are  prescribed.  We  may  find  a  continuum  of  phase  arrangements 
satisfying 


of  nonuniqueness  of  steady  flow 
Poiseuille  flow  in  layers.  The 
u  (y)  is  generated  by  a  constant 


P-  =  -G 
dx 


instead  of  by  shearing  from  the 


(5.1) 

Ay 


where,  at  each  interface  lul  = 
This  problem  has  a  continuum  of 
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Fig.  5.1:  Layered  plane  Poiseuille  flow  in  three  layers. 
The  configurations  in  (a)  and  < b)  are  the  centrally  located 
ones.  The  configuration  in  (a)  maximizes  the  mass.  flux 
for  a  given  pressure  gradient  ( — G)  among  the  continuum  of 
contiguous  layers  of  high  viscosity  (vu)  layers  of  total 
height  and  l°w  viscosity  (u^)  layers'  of  total  height  l 


u 


=  sx  u(y,  z) , 


*2u. 


8y2 


a2 
d  u , 


V 


U(Y;  Z)  ~  0, 

1 


=  ~G  ;  1=1,  2  . 


At  each  interface  we  require  that  the  velocity  and  the  shear  stress 
be  continuous ,  The  prescription  of  the  shape  and  placement  of  the 
interfaces  is  left  completely  arbitrary,  subject  to  the  constraint 
of  prescribed  total  area.  The  jump  of  the  normal  stress  across 
interfaces  vanishes  automatically  for  solutions  of  Poiseuille  type 
when  the  surface  tension  G  —  0 .  If  o’  ^  0,  the  pressure  will  jump 
across  each  interface,  and  the  value  of  the  jump  will  be  independent 
of  (y,z),  and  balanced  by  a  constant  surface  tension  force;  that 
is  by  constant  interface  curvatures.  Hence,  when  a  ^  0,  the 
allowed  interfaces  are  circles,  or  circular  arcs  terminating  on 
boundaries.  The  number  of  such  arcs  and  their  placements  are 
arbitrary. 

There  are  experiments  which  suggest  that  in  the  flow  of  two 
fluids  in  a  pipe  there  is  a  tendency  for  the  low  viscosity  fluid 
to  encapsulate  the  high  viscosity  one.  The  flows  are  such  as  to 
put  the  high  viscosity  component  in  the  center  of  the  pipe  where 
the  shears  are  smallest  (see  §10  and  JRR(1983)). 

7.  NQNUNIQUECQUETTE  FLOW  IN  CIRCULAR  RINGS  BETWEEN  ROTATING 
CYLINDERS .  Another7  case' of  nonuniqueness  of- steady  flow  of  two 
immiscible  fluids  is  Couette  flow  in  circular  rings  between  rotating 
cylinders.  We  shall  imagine  that  the  fluids  have  the  same  density 
but  different  viscosities.  We  look  for  solutions  in  which  the 
liquids  are  arranged  in  contiguous  circular  layers  subject  to 
a  prescription  of  the  total  volume  of  each  component  liquid,  (Fig.  7.1). 
Some  solutions  of  this  type  will  be  given  in  §12.  We  look  for 
solutions  in  each  layer  in  the  form 

u  =  e0  v (r) 


where  the  speeds  of  the  inner  (r-a)  and  outer  (r=b)  cylinders 
are  prescribed 


v  (a)  =  a, 


v  ( b)  =  ft2b  . 
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The  v(r)  that  is  required  by  the  Navier-Stokes  equations  in  each 
layer  is  of  the  form 


v  ( r) 


Ar  + 


B 

r 


The  number  and  thickness  of  such  circles  are  arbitrary,  with  the 
fixed  volume  constraint,  and  we  can  choose  sets  of  {A}  and  {B}  so 
that  at  each  interface 


tv] 


tv 


d  (v/r)  ^ 
dr  J 


0. 


The.  normal  stress  equation  may  be  satisfied  by  balancing  pressure 
jumps  against  surface  tension- 


(a)  (b) 


Fig.  7.1:  Couette  flow  in  circles  of  immiscible  liquids  of 

the  same  density  but  different  viscosity:  (a)  two 
layers,  (b)  three  layers  with  the  same  total  volume  of 
y^  and  The  number  and  thickness  of  these  layers  are 

arbitrary. 


8.  MORE  GENERAL  EXAMPLES  OF  NONUNIQUENESS  IN  THE  STEADY  FLOW 
OF  IMMISCIBLE  LIQUIDS"!  Nonunique  arrangements  of  components  in 
bicomponent  flows  appear  to  be  a  general  property  going  far  beyond 
the  simple  examples  exhibited  in  the  previous  sections. 

For  example  we  could  generate  nonunique  two  dimensional 
"Poiseuille"  flows  by,  say,  a  wavy  perturbation  of  the  solid  boundary. 
We  should  then  of  course  be  obliged  to  show  how  the  hydrodynamics 
and  interfaces  perturb  under  the  boundary  perturbation. 
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As  another  example  we  may  consider  how  the  circular  flows  in 
Fig.  7.1  perturb  when  there  is  a  small  density  difference  H  p H  =  e  ^  0, 
We  expect  to  see  slightly  displaced  circles. 

As  another  example  consider  the  problem  of  one  fluid  displacing 
another  in  a  pipe  (see  Fig.  8.1).  This  problem  is  a  model  for 
studies  of  the  motion  of  the  contact  line.  If  there  are  such 
solutions,  they  are  not  unique  because  there  are  also  at  least 
layered  Poiseuille  flows.  If  <  y2  the  less  viscous  fluid 

will  finger  into  the  more  viscous  fluid.  In  this  case,  and  also  when 
pi 2  <  the  configuration  shown  in  Fig.  8,1  is  probably  unstable. 

Another  example,  due  to  F.  Busse  (198  2),  is  described  by  him 
as  follows:  "Convection  in  a  fluid  layer  heated  from  below  with 
two  immiscible  components  A  and  B  of  the  same  average  density  exhibits 
different  states  of  motion.  Besides  a  solution  describing  convection 
in  sublayers  other  solutions  in  which  fluid  B  is  surrounded  by 
streamlines  of  fluid  A  or  vice  versa  are  possible." 

In  our  experiments,  we  see  many  different  configurations. 

There  are  persistent  solutions  with  bubbles  of  low  viscosity  fluid 
in  high  viscosity  foams,  which  are  steady  in  some  average  sense. 

We  conclude  that,  unlike  single  component  flow,  there  is  a 
pervasive  lack  of  uniqueness  in  the  flow  of  immiscible  liquids  even 
when  the  Reynolds  number  is  small  and  even  when  the  region  of  flow 
is  bounded. 


Fig,  8.1:  One  fluid  displaces  another  in  a  horizontal  pipe. 
This  configuration  is  probably  unstable. 


9,  PHYSICAL  MECHANISMS  FOR  SELECTING  THE  ARRANGEMENT  OF 
COMPONENTS  PREFERRED  IN  THE  FLOW  OF  IMMISCIBLE  LIQUIDS.  The  natural 
thing  to  do  when  faced  with_nonuniqueness^is  to  study  stability. 

It  is  probable  that  most  of  the  possible  steady  configurations  of 
flow  are  unstable.  Our  experiments  show  that  there  is  a  definite 
tendency  for  flows  of  immiscible  liquids  to  arrange  themselves  in  such 
a  way  as  to  shield  the  high  viscosity  components  from  intense 
shearing.  We  shall  call  this  tendency  for  low  viscosity  liquids  to 
migrate  into  the  regions  of  high  shears  an  encapsulation  instability. 

A  fingering  instability  which  is  responsible  for  the  formation  of 
emulsions  is  also  important  in  the  context  of  encapsulation.  These 
types  of  instability  are  important  because  they  represent  a  type  of 
self -lubrication  principle. 
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There  is  an  irresistible  temptation  to  invoke  variational 
principles  for  selection  criteria  for  the  arrangements  of  components 
which  are  preferred  in  bicomponent  flow.  These  principles  are 
suggested  by  various  results  associated  with  extremalizing  the 
dissipation.  Such  principles,  as  a  strict  statement  of  the  under¬ 
lying  physics,  are  oversimplified.  In  fact,  exact  variational 
principles  are  ambiguous.  One  can  state  different  principles 
leading  to  different  selection  rules.  We  are  going  to  postulate 
some  such  "principles",  because  they  are  mathematically  interesting, 
very  suggestive  of  true  underlying  physics  and  provide  the  right 
ambience  for  the  discussion  of  this  problem.  So  in  this  case  as 
in  many  others  we  do  not  resist  irresistible  temptation.  However, 
in  JRR(1983)  a  good  variational  problem  is  stated,  solved, 
shown  to  agree  with  experiments  (the  experimental  results  that 
we  know;  there  are  surely  others)  and  to  be  in  partial  agreement 
and  partial  contradiction  with  the  analysis  of  stability. 

The  big  interest  in  variational  ideas  for  us  was  that 
it  guided  our  intuition  in  the  design  of  experiments  in  situations 
in  which  analysis  is  hard,  impossible  or  in  any  event  not  available. 
Probably  the  coarse  statement  that  the  hydrodynamics  shields  the 
high  viscosity  component  from  shearing  is  more  nearly  correct  than 
any  exact  variational  embodiment  of  it. 

10.  ENCAPSULATION  INSTABILITIES  AND  EXTREMAL  PRINCIPLES  FOR 
PIPE  FLOW.  In  pipe  flow  of  two^liquids  with  different  viscosities 
under  an  applied  pressure  drop,  the  low  viscosity  liquid  will  tend 
to  encapsulate  the  high  viscosity  liquid.  If  the  effects  of  gravity 
are  negligible,  the  phases  will  arrange  themselves  so  whatever  may 
have  been  the  initial  configuration,  the  high  viscosity  phase  will 
ultimately  be  centrally  located  (as  in  Fig.  5.1(a)).  This  property 
has  been  convincingly  demonstrated  in  experiments  with  very  viscous 
viscoelastic  liquids  (polymer  melts)  by  Southern  and  Ballman  (1973) , 
Everage  (1973) ,  Lee  and  White  (1974) ,  Williams  (1975) ,  and 
Minagawa  and  White  (1975),  as  well  as  in  the  flow  of  oil  and  water, 
in  which  the  water  migrates  to  the  pipe  wall,  forming  a  lubrication 
layer,  studied  by  Charles  and  Redberger  (1962) ,  Hasson,  Mann  and 
Nir  (1970) ,  and  Yu  and  Sparrow  (1967) . 

In  the  experiments  of  Southern  and  Ballman  (1973)  encapsulation 
of  the  type  exhibited  in  Fig.  10.1  is  documented.  Everage  (1973) 
shows  a  photograph  of  complete  encapsulation  with  a  centrally  located 
high  viscosity  nylon  completely  encapsulated  by  an  annular  ring 
of  low  viscosity  fluid. 

Theoretical  explanations  of  the  slow  envelopment  phenomenon 
have  Up  to  now  been  based  upon  extremalizing  energy  dissipation  as 
originally  suggested  by  Southern  and  Ballman  (1973).  Maclean  (1973), 
who  considered  planar  layered , flow,  and  Everage  (1973),  who  studied 
a  cylindrical  geometry,  both  invoked  a  variational  principle  to 
show  that  the  phase  configuration  with  the  high-viscosity  component 
centrally  located  is  favored  over  several  other  configurations. 

We  are  going  to  state  a  general  variational  criterion  which 
presumably  reduces  to  the  ones  first  discussed  for  special  cases.  We 
say  that  realized  flows  arrange  the  two  components  so  as.  to  maximize 
the  speeds  when  the  tractions  on  exterior  boundaries  are  fixed,  or  to 
minimize  the  tractions  when  the  speeds  are  prescribed.  For  pipe  flow 
we  arrange  the  two  flowing  components  to  maximize  the  mass  flux 
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when  the  pressure  gradient  is  prescribed  or  to  minimize  the  pressure 
gradient  when  the  mass  flux  is  prescribed.  The  extremalizing 
arrangement  is  the  one  with  high  viscosity  liquid  in  the  center 
( JRR (1983) ) .  We  could  draw  an  analogy  with  turbulent  flow  in 
which  it  is  shown  that  among  all  the  turbulent  solutions,  the 
laminar  one  maximizes  the  mass  flux  for  a  given  pressure  drop.  In 
fact  the  extremalizing  solution  is  not  always  stable.  The  results 
(JRR (1983 ) )  depend  on  the  volume  ratio  with  stable  flows  character¬ 
ized  by  narrow  layers  of  less  viscous  fluid  on  the  outside. 

If  we  suppose  that  nature* s  design  corresponds  to  man's  desire, 
we  could  hope  to  realize  the  solution  of  the  problem  of  maximizing 
the  flux  of  oil  down  a  pipe  of  fixed  radius  by  lubricating  the  pipe 
wall  with  water.  In  fact  the  solution  of  the  design  problem  is_ 
a  stable  one  according  to  the  calculations  give  in  JRR(1983).  The 
design  problem  is  as  follows: 

Water  and  oil  flow  steadily  along  a  horizontal  pipe  of  radius 
R.  The  lower  viscosity  (V^)  water  completely  encapsulates  the 

higher  viscosity  (y^)  oil,  which  flows  as  a  central  core  of  radius  a. 

The  flow  is  described  by  the  usual  equation,  with 


u2  =  0  at  r  -  R 


and 


du. 


dun 


2  dr 


=  U' 


Idr  J 


at 


a  . 


The  velocity  distributions  in  the  two  fluids  are: 


u.  — 


4p. 


a2  f  yl  -2 
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R 


2  a 


1)  +  (1  -  :~2 ) 


U  0  <  r  <  a 
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r,  2  2  y,  2  : 

*2  =  fer<G>  (~2°  (-2-  -  =• 

1  R  ^  2  a  a 


-)  ,  a  <  r  <  R, 


66 


where 


G  s  (-  a£> 


The . volume  flow  rate  of  oil  (Q^)  is  then 


^1  _  a* 
Q  R4 


V1  R2 

1  +  2(— -)  (~  -  1) 
v2  a 


where  Q  - 


R 

-^p(G)  is  the  volume  flow  rate  of  the  single  phase  (u1) 


when  it  fills  the  entire  pipe.  For  fixed  R  and  G,  the  volume  flow 
rate  Q1  has  a  maximum  when 


a 

R 


vl/v2 


and  the  maximum  value  is  then 


t) 


a2 


1,2 


max 


2<pT>-  r 

m2 


For  large  values  of  the  viscosity  ratio  as  would  occur  for 

example  in  an  oil/water  configuration,  the  maximum  volume  flow  rate 

Q1  is  obtained  when  a  s —  ,  and  is  given  by 

/2* 


I'l 

'  h  (— )  . 
2  JniB.x.  y  2 
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‘K?£,*X «  Iw, 

..it,  pigj. 

BalIman,  1973, 


>1  tr 


The  volume  flow  rate  of  the  lower  viscosity  fluid  is 


Thus  the  volume  flow  rate  Q2  required  to  maximize  is 


For  large  values  of  —  this  reduces  to 

y2 


-  1/4  ( 


or 


Q  / 


max  Q. 


Q 


max  . 


11.  EVOLUTION  OF  THE  ENERGY  AND  BALANCE  OF  POWER  OF  THE 
EXTERIOR  TRACTIONS  IN  THE  BICOMPONENT  FLOW  OF  IMMISCIBLE  LIQUIDS., 

The  following  energy  equation  due  to  E.  Dussan  V  governs  the  flow  of 
immiscible  incompressible  fluids  (see  Joseph,  1976,  for  a  full 
discussion) : 
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(11.1) 


+  C  OT/Ud* 
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+  f 
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u-  (T-n) 


9*y* 
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J  tr  T  D[u]  . 


The  terras  of  (11.1)  need  to  be  defined  and  discussed.  The 
definitions  of  volumes,  surfaces,  distinguished  directions  and 
velocities  are  defined  under  Fig.  11.1.  The  stress  T  is  given  by 
(2.1),  D  [u  ]  is  the  stretching  tensor  for  u. 


(kinetic  energy) , 


(potential  energy) , 


(surface  energy) , 


(derivative  of  surface  tension 
following  a  particle  in  the  surface) 


ct  *ud^ 


(power  or  working  of  the  contact 
line) 


u-  (T *  n) 
7>-y> 


(power  or  working  of  the  traction 
vector  T *  n  on  the  exterior 
boundary) 


f  tr  T  D [ u] 


JY 


(stress  power  or  dissipation) 


For  incompressible  Newtonian  fluids  the  dissipation  is  in  the  form 


|  2\t  Tr  D2[Uj. 
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Fiq  11.1:  Volumes,  surfaces  and  distinguished  directions 
for  two  fluids  separated  by  an  interface. 

and are  the  volumes  occupied  by  the  two  fluids, 

n  is  the  outward  normal  to  9"*^?  (i  “  i*  2)  , 

“i 

X  =  9^  is  the  interface  separating  the  two  fluids, 

n  is  the  normal  to  l  pointing  from^  toY\, 

3^/°-  d^f  U  -  E  is  the  exterior  boundary;  parts  of  this 

1  2 

boundary  may  be  made  of  rigid  solids, 
n  is  the  outward  normal  on  9^ 

9Z  =  1  fl  9^  is  the  contact  line;  l  is  an  arc  length  on  this 
line, 

t  is  the  tangent  vector  on  91, 

T  «  tAn  is  the  normal  to  9  E  on  E , 

u.  is  the  velocity  of  the  fluid  in'^Ci  =1,  2), 

U  is  the  velocity  of  a  point  o^the  contact  line- 


The  steady  flow  of  immiscible  fluids  satisfies  (11.1)  with  time 
derivatives  set  to  zero.  If  cr  =  0,  U  =  0  or  if  there  is  no  contact 
line  as  in  some  flows  of  two  fluids  in  pipes  or  between  cylinders, 
then  for  Newtonian  fluids 


(11.2) 


(T  *  n) 


One  selection  rule  which  may  be  postulated  is  as  follows. 

The  realized  placements  of^/^  and  within  *?/>  is  the  one  which 

maximizes  the  speeds  u  for  prescribed  tractions  on  the  exterior 
boundary  By9 or  minimizes  tractions  for  prescribed  speeds.  Joseph, 
Renardy  and  Renardy  (1983)  have  shown  that  this  is  a  well-defined 
problem  with  a  definite  solution  in  some  cases.  The  selection  rule 
could  be  stated  in  terms  of  fluxes  and  forces  instead  of  speeds 
and . tractions .  The  selection  rule  requires  that  we  first  specify  a 
class  of  nonunique  flows,  say  layered  Poiseuille  flows,  before  we 
seek  the  optimum  placements  of  components.  Maximizing  speeds  is 
the  same  as  maximizing  dissipation  for  prescribed  tractions. 

Minimizing  tractions  for  given  speeds  requires  that  we  minimize 
the  dissipation.  In  Poiseuille  flows  it  is  natural  to  fix  the  pres¬ 
sure  gradient  (tractions)  and  maximize  the  flux  (speeds) .  In  contrast, 
in  Couette  flows  we  specify  the  speeds  (angular  velocity)  and 
minimize  the  shear  stresses  (torques)  at  the  boundary. 

12.  _VARIAT_IONAL  PROBLEMS  FOR  COUETTE  FLOW  IN  PLANE  AND 
CIRCULAR  LAYERS  AND  FOR  LAYERED  _FOISEU ILLS  FLOW.  Consider  the  set 
of  layered  Poiseuille- f lows  discussed  in  §5.  In  Fig.  5.1  we 
exhibited  some  examples  of  three  layer  configurations.  In  general, 
the  layered  Poiseuille  flows  are  uniquely  determined  up  to 
arrangement.  This  means  the  total  number  N  of  layers  and  the  size 
of  the  layers,  subject  to  total  height  constraints,  are  left 
undetermined.  Now  we  shall  write  the  energy  balance  (11.2)  for 
layered  plane  Poiseuille  flow.  The  volume  here  is  a  plane  area 
of  channel  height  l  and  length  L  (along  the  axis  x) .  Since  u 
vanishes  on  the  solid  walls  the  integral  on  the  left  of  (11.2)  is 
over  planes  perpendicular  to  x  at  x  and  x  +  L. 


(T  *  n)  = 
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L 
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where 


l 

Q  =  |  udy  is  the  volume  flux. 
J0 


On  the  other  hand 


pD :  D 


pu’2(y)d^ 


L 
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I  yu?2(y)dy. 
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Hence 


(12, 1) 


i 

2GQ  =  I  uu'^(y)dy. 

;o 


We  next  recall  that  between  0  and  l  are  N  contiguous  layers  with 
fluids  of  different  viscosities.  We  can  suppose  that  the  layer 
nearest  the  bottom  is  occupied  by  a  fluid  of  viscosity  p1#  the 

next  layer  has  P ^ /  then  P^  again,  and  so  on.  So  besides  the  total 

number  N  of  layers  and  their  sizes,  we  need  to  know  if  p1  is  the 

larger  viscosity.  We  suppose  that  the  total  volume  (height)  of 
high  viscosoty  (P+)  fluid  is  given  as  and  is  the  volume  of  low 

viscosity  fluid  and  The  N  layers  are  divided  into 

intervals 


to  r y  cd  ] ' 


(1)  'y  (2) 


]  /  [y 


(2)  /Y (3) 
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(N-l) 
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With  G  given  we  maximize 
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(Y  1 

(12.2)  Q  =  j  ‘  u;l) (y)dy  +  j 
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(2) 


u  ^  (y)  dy  +.  .  .+ 


(1) 


’y(N-i) 


u(N) (y)dy 


where 


(12.3) 


r*(  1  (Y  ( 2 )  2 

f0UCl)(y)dy  +  P2jy(1)  U(2)'y)dy 
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Now  we  change  variables: 


U(l)  “  "  ^  V(D  ' 


*1  V) 

where 

(1 2 .  =1 )  v’  .  =  -  y  +  Y  ,  ye  [  0  /  &  1 

satisfies  (5.1),  Dju’I  =  0  at  each  interface  and  there  is  one 
Y  e  (0,5.)  such  that  v*  (n)  (Y)  =  0.  Inserting  this  change  of 

variables  into  (12.3)  we  get: 


(12.4) 

U (3)  =  " 
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We  maximize  this  by  arranging  the  layers  so  that  the  high  viscosity 
in  the  denominator  is  associated  with  the  smallest  value  of 

(y  _  y)2*  This  is  clearly  the  arrangement  of  (a)  of  Fig.  (5.1),  so 
N  -  3,  and  u1  is  the  low  viscosity  U__. 

The  same  considerations,  but  in  more  complicated  form,  enter  into 
the  rigorous  solution  of  this  problem  for  pipe  flow  given  by  JRR(1983). 

Couette  flow  differs  from  Poiseuille  flow  in  that  it  is 
perhaps  more  natural  to  prescribe  the  speed  of  exterior  boundaries 
and  to  minimize  the  torque.  We  first  note  that  for  the  Couette 
flow  between  cylinders  in  N  rings  and  layers  which  is  shown  with 
N  =  .2  and  3  in  Fig.  7 . 1  we  have 

e  -(T-c)  =  T<0X>  =  0 


so  that 


(12.7)  I  u  • (T-n) 

1  3V> 

=  2 it  aniT-(a?>}  =(r‘2  “  ni)M 

where  ft  2  anc^  are  ^he  angular  speeds  of  the  outer  and  inner 

cylinders,  and  M  =  27rrT<r<3>  (r)  is  the  torque  and  it  is  constant  for 
a  1  r  5  b.  It  is  clear  from  (11.2)  that  (ft2  -  ft^)M  is  positive  and 


v(r) e. 
3V 


(T  *  n) 
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(12.8) 


and 


r(2) 


u  =  £gV(r) ,  Ipl  =  0  and 


^(r>  1  =  ™  =  °  ^  r{1)  r(2)f 


We  want  to  choose  the  arrangements  of  layers  and  the  placement 
of  viscosities  so  as  to  minimize  |m|  when  ^2  a^d  ^  are  prescribed. 

It  will  suffice  to  solve  this  problem  for  the  two  layer  configuration 
shown  in  Fig.  7.1.  We  may  always  consider  the  problem  posed  for 
two  adjacent  layers.  We  find  that  the  minimizing  solution  has  the 
lower  viscosity  fluid  on  the  inside.  We  conclude  that  N  =  2  with 
more  viscous  fluid  on  the  outside  and  less  viscous  fluid  inside. 
Suppose  =  d  is  the  interface  between  two  layers  at 

r  =  a  and  r  =  b.  The  fluid  with  viscosity  occupies  the  region 

a  £  r  £  fluid  with  viscosity  occupies  the  region 

d  <  r  <  b.  We  find  that  in  d  <  r  <  b 
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v  =  A.r  +  B  /r  , 


A1  ~  ^2  ^  2 


£2. 
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2  2  2  2 

where  k  =  (b  -  d  )/(d  -  a  ) ,  is  positive,  we  may  without  losing 

generality,  consider  the  case  for  which  M  and  are  positive. 

It  is  immediate  that  M(k,y^,u2)  i-s  a  monotonically  increasing 

function  of  \i^r  from  zero  at  y2  “  0  to 

b2(!T.2  -  £21)2(ka2  +  b2)p1/(b2  -  a2)k:. 


It  follows  from  monotonicity  that  M  is  larger  when  y 2  >  y^ 

than  when  y2  <  y^.  So  we  minimize  the  torque  by  putting  the 

lower  viscosity  fluid  y  =  y2  on  the  inner  cylinder  at  radius  r  =  d. 

The  situation  when  one  of.  the  liquids  occupies  an  infinite  re¬ 
gion  has  to  be  treated  separately.  For  example,  when  b+<»  and  ft  +  0 
then  1 
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1^  |a  d  y1y2 
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r 


which  for  a  fixed  d  is  smaller  when  y2  >  y ^ . 

So  in  every  pair  of  layers  the  arrangement  which  minimizes  the 
torque  has  the  low  viscosity  liquid  on  the  inside.  It  follows 
that  optimal  arrangements  of  layers  for  minimum  torque  is  the  one 
with  two  layers  and  the  less  viscous  fluid  on  the  inner  cylinder. 

The  preferred  arrangement  for  the  problem  of  plane  Couette 
flow  in  which  the  velocity  is  U  at  y  =  &  and  zero  at  y  =  0  may 
be  studied  using  (11.2)  in  reduced  form 


C12.ll) 


UT 


<xy> 


ri 


U'2(v 


y)  dy. 


It  is  not  hard  to  verify  that  the  value  of  the  integral  on  the 
right  of  (12.11)  is  independent  of  the  number  and  size  of  layers  if 
the  total  volume  Z  and  l  of  the  high  and  low  viscosity  fluids  is 
prescribed.  It  follows  that  the  variational  problem  for  the  preferred 
arrangement  of  layers  in  plane  Couette  flow  has  no  solution. 
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13.  STABILITY,  Ultimately  the  only  satisfactory  theoretical 
approach  to  the  problem  of  selection  is  through  stability.  There  are 
a  few  papers  which  treat  the  problem  of  stability  of  immiscible 
liquids  with  different  viscosities.  Of  these,  the  papers  of  Yih 
(1967)  and  Joseph,  Renardy  and  Renardy  (1983)  are  most  important. 

The  problem  of  stability  is  clearly  identified  as  a  problem  of 
selection  of  stable  arrangement  of  components  in  the  latter  paper. 
Interested  readers  should  consult  these  papers  and  §10  of  this 
paper  for  more  references. 

14 .  ENCAPSULATION  INSTABILITIES .  We  say  that  a  family  of 
steady  configurations  of  components  in  the  flow  of  immiscible  liquids 
undergoes  an  encapsulation  instability  when  this  family  gives 

way  to  motions  in  which  the  high  viscosity  liquid  is  shielded  from 
shearing  by  the  low  viscosity  liquid.  This  type  of  instability 
can  be  observed  as  a  migration  of  low  viscosity  liquid  into  regions 
of  high  shear.  Sometimes  the  low  viscosity  liquid  moves  into  the 
region  of  high  shear  as  a  sheet,  and  sometimes  (for  example, 
when  the  high  viscosity  liquid  wets  the  moving  boundary)  the  low 
viscosity  liquid  fingers  into  the  high  viscosity  liquid,  droplets 
are  torn  off  the  fingers  and  move  into  the  region  of  high  shear  as 
an  emulsion  of  droplets  of  low  viscosity  liquid  in  a  high  viscosity 
foam. 

The  series  of  experiments  described  below  were  motivated  by 
observations  of  encapsulation  instabilities  in  pipes.  We  wanted 
to  know  if  the  theoretical  explanation  of  the  observations  which 
involved  extremalizing  dissipation  could  be  defended  and  extended. 

We  reasoned  that  minimum  dissipation  would  put  low  viscosity  liquid 
on  a  rotating  rod,  in  the  region  where  there  is  the  greatest  shear, 
as  a  kind  of  lubrication  bonanza.  Though  our  original  idea  was 
known  by  us  to  be  oversimplified  it  was  definitely  useful  as  a 
guide  to  intuition  and  interpretation. 

The  experiments  were  carried  out  in  two  rectangular  plexiglass 
boxes  shown  in  Fig.  14,1.  A  box  is  filled  with  the  heavier  of  two 
liquids  up  to  the  central  diameter  of  the  rod  and  the  lighter 
liquid  is  floated  on  the  top,  as  in  Fig.  14.2.  Since 
two  liquids  are  always  used  the  density  differences  are  not  vast. 

The  rod  is  set  into  steady  rotation.  We  want  to  know  which  liquid 
will  coat  the  rod.  The  notion  that  the  hydrodynamics  will  develop 
so  as  to  minimize  the  torque,  for  a  given  speed,  implies  that  low 
viscosity  fluid  will  coat  as  in  Fig.  14.3. 

The  experiments  in  which  we  achieved  sheet  coating  of  the  type 
described  by  Fig.  14.3  are  summarized  in  Table  14,1.  Photographs 
of  5  entries  from  Table  14.1  are  shown  in  Plates  14,1  -  14.5. 

15.  ROLLERS.  There  are  very  viscous  oils  which  are  extremely 
sticky  in  the  sense  that  they  strongly  adhere  to  certain  solid 
surfaces.  Polymeric  oils  are  sticky  in  this  sense.  High  viscosity 
(y  =  950  poise)  silicone  oils  and  STP  (y  =  110  poise)  were  sticky 
to  aluminum  and  plexiglass  in  this  sense.  When  the  oils  were 
floated  on  water  the  contact  angle  showed  that  aluminum  favored 
water  over  STP  (see  Plate  15.1(a))  or  silicone  oil  (water  wets  the 
rod) .  However,  when  the  rod  is  turned  on,  it  is  the  oil  that  coats, 
and  in  copious  quantities.  This  is  in  apparent  contradiction  of  the 
dissipation  principle  which,  thinking  superficially,  would  put 
water  on  the  rod.  So  we  have  the  impression  that  contact  angles  do 
not  tell  the  whole  story  about  stickiness. 


79 


Liquid  1  Liquid  2  ^^3  PvC£^  ct:e^ 

(Centipoise)(Csntipoise)'gin/cm3)  (gm/cin  )  BV: 


silicone  I 

glycerine 

19 

1761 

0,96 

oil 

silicone  I 

water 

19 

1 

0.96 

1.0 

oil 

light  machine  oil 

glycerine 

6.36 

1761 

0.331 

1.25 

oil 

light  machine  oil 

corn  syrup 

6.36 

14 

0.831 

1.20 

oil 

silicone  I 

corn  syrup 

19 

12 

.96 

1.20 

oil 

veg.  oil 

glycerine 

60 

1761 

0.92 

1.25  1 

oil 

00 

o  castor  oil 

glycerine 

700 

1761 

0.96 

1.25 

oil 

water 

dibutyl 

phthalate 

1 

18-19 

1 

1.045 

dibutyl  phthalate 

2%  polyacrylamide 

dibutyl 

phthalate 

"700 

18-19 

1.02 

1.045 

dibutyl  phthalate 
(sheet  coating  at  hiah  speed) 

castor  oil 

1%  polyacrylamide 

700 

90 

0.96 

oil 

(sheet  coating  at  high  speed) 

_ silicone  III _ 

_ water _ 

1 

0.95 

water 

Table  14.1 


Experiments  in  which  the  low  viscosity  fluid  coats  in  sheets. 


Fig.  14.1:  Sketch  of  plexiglass  boxes  used  in  the 

experiments.  A  rod  is  inserted  through  the 
long  planar  sides  of  the  box  and  is  attached 
to  a  variable  speed  motor.  The  [length,  height, 
depth,  rod  diameter,  rod  composition]  of  the 
two  boxes,  I  and  II,  respectively  are: 

[20.32  cm,  22.86  cm,  10,16  cm,  5.08  cm,  plexiglass] 

for  I  and 

[20.32  cm,  11.43  cm,  7.62  cm,  2.54  cm,  aluminum] 
for  II. 

The  red  rotates  counter-clockwise  in  box  I  and 
clockwise  in  box  II.  The  experiments  in  Plates 
14.1  -  14.4,  15.3,  16.1,  16.2  and  17.1  are 
carried  out  in  box  I  and  the  experiments  in  Plates 
14.5,  14.6,  15.1,  15.2,  16.3  and  16.4  are  carried 
out  in  box  II. 
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The  plexiglass  box  is  loaded  with  two  liquids 
with  the  undisturbed  interface  at  the  level  of 
the  horizontal  diameter  of  the  rod. 


The  rod  rotates  and  the  less  viscous  liquid 
coats  the  rod.  This  kind  of  coating  is  called 
sheet  coating.  -We  say  that  the  low  viscosity 
liquid  encapsulates  the  high  viscosity  liquid 
because  it  shields  the  more  viscous  liquid  from 
intense  shearing. 


For  the  very  sticky  and  viscous  oils  the  configuration  which 
appears  to  minimize  dissipation  is  sometimes  achieved  by  a  kind 
of  miracle  which  we  call  rollers.  In  Plate  15.1(a)  we  show  the 
static  configuration  in  which  STP  is  floated  on  water.  The  STP 
wets  the  plexiglass  box  and  water  wets  the  aluminum  rod.  The  rod 
was  put  into  steady  rotation  (clockwise  in  Plates  15.1); 
it  transported  all  of  the  STP  on  the  left  side  of  the  box  (i.e.  the 
upward -motion  side  of  the  rod)  to  the  roller  of  STP,  or  to  a  stag¬ 
nant  region  of  STP  on  the  right  (i.e,  the  downward -motion  side  of  the 
rod) .  A  thin  sheet  of  water  was  pulled  between  the  STP  roller  and 
the  stagnant  STP  (see  Plate  15.1(b)  and  Fig.  15,1),  Astonishingly 
the  roller  of  STP  also  separated  from  the  wall  of  the  box  even 
though  it  is  well  known  to  us  that  STP  is  very  sticky  to  plexiglass. 

This  hydrodynamic ally  generated  separation  of  STP  from  the  plexiglass 
side  wall  is  shown  in  Plate  15.1(c). 

The  STP  roller  rotates  as  a  rigid  wheel,  lubricated  by  water  from 
all  sides.  The  stagnant  STP  on  the  right  barely  moves.  A  sketch 
of  Plate  15.1  is  shown  in  Fig.  15,1  so  that  there  is  no  ambiguity 
about  what  is  being  shown.  This  hydrodynamic  configuration  evidently 
reduces  the  total  dissipation  to  a  very  small  value  associated 
mainly  with  shearing  water  in  a  lubrication  layer. 

Data  for  experiments  leading  to  rollers  is  tabulated  in  Table  15.1. 

The  rollers  are  not  hard  to  obtain.  They  seem  to  arise  out 
of  an  encapsulation  instability  in  which  the  water  spontaneously 
migrates  into  the  regions  in  which  it  undergoes  high  shear,  shielding 
the  STP  from  intense  shearing.  We  have  not  yet  studied  the  rollers 
under  systematic  variations  of  the  parameters.  However,  in  another 
set  of  experiments  we  tried  to  remove  some  of  the  STP  from  the 
stagnant  region.  In  fact  we  removed  all  but  about  a  6mm  layer  of 
this  STP  without  visibly  affecting  the  stability  of  the  roller.  But 
at  a  critical  value  of  the  depth  of  "dead"  STP,  the  roller  bifurcated 
into  another  roller  with  triangular  symmetry.  This  bizarre  triangular 
"figure  of  equilibrium"  was  unstable  but  not  violently  so.  In 
an  attempt  to  save  the  day  we  added  some  STP  to  the  stagnant 
region  and  recovered  stability.  However,  the  newly  stable  roller 
was  lop-sided  and  ugly  so  we  put  a  screwdriver  in  the  box  and  molded 
it,  as  does  a  potter  at  his  wheel,  into  the  automobile  tire  shape  of 
large  radius  which  is  exhibited  in  Plates  15.2(a)  and  15.2(b). 

The  STP  rollers  are  robustly  stable.  They  withstand  large 
perturbations  and  can  rotate  for  weeks  without  apparent  change. 

The  development  of  rollers  might  be  thought  to  be  associated 
to  a  degree  with  normal  stresses  characteristic  of  shear  flows 
of  non-Newtonian  fluids.  We  discount  explanations  of  our  observations 
based  on  normal  stresses  because  the  shear  rates  in  the  rollers 
appear  to  be  small  and  because  we  can  obtain  rollers  in  Newtonian 
liquids . 

The  photographs  exhibited  in  Plates  15.3  show  rollers  of 
silicone  oil  (950  poise)  in  water.  The  viscosity  ratio  is  95,000. 

The  roller  of  silicone  oil  is  almost  perfectly  round,  and  it  has 
detached  from  the  walls  under  hydrodynamic  action.  The  angular 
velocity  of  the  silicone  roller  is  constant;  the  roller  rotates 
as  a  rigid  wheel.  The  roller  is  robustly  stable,  it  rotates  for 
weeks  without  change  of  form.  The  dynamics  of  the  roller  appears  to 
be  governed  by  the  inviscid  equations  of  motion.  There  is  an  unknown 
constant  in  the  pressure  which  determines  the  radius  of  the  roller, 
through  the  balance  of  normal  stress  with  surface  tension.  This  radius 


83 


01 

■H 


U 

W  <JJ 

fQ  rH 
rH 


c 

01  O  T3 
-H  *H  <11 
■M  TJ 
U  'H  vH 
01  01  0) 
rH  C  I 
h  rj  a 
O  M  O 

^  4-1  H 
* 


84 


Table  15.1 

Experiments  in  which  the  high  viscosity  liquid  formed  rollers  lubricated  by  water. 


water 


Fig.  15.1: 


(a)  Sketch  of  plate  15.1(c);  (b) 
plate  15.1(b). 


sketch  of 


pure  silicone 
oil 


^  ^  °  <=>  _  ~ 
^  °0  O  ^ 


woter^^k 


pure  silicone 

.with  water 
droplets 


mo  °  mm 
fJ°o  foil 

% 

5§F;:. 

aqo  .  mmm 

’ig .  15.2:  Fingering  of  water  droplets  into  high  viscosity 
( 9  5  0  0  0  cp )  silicone  oil. 


(b) 


Fig.  15.3:  Schematic  of  the  experiment  with  STP  and  water 
in  the  four-roller  apparatus. 
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is  not  uniquely  determined.  In  fact  the  roller  in  Plates  15.3  has 
captured  all  of  the  silicone  oil  which  was  originally  floated  on  the 
water.  Presumably  if  we  had  floated  more  or  less  oil  we  would 
achieve  a  roller  with  larger  or  smaller  radius.  The  silicone  oil  is 
attracted  energetically  to  the  plexiglass  rod;  it  is  favored  over 
water . 

We  next  describe  what  happened  when  we  tried  to  obtain  rollers 
with  the  same  silicone  oil  (y  =  95,000  cp)  in  the  box  with  an 
aluminum  rod  which  was  wetted  by  water  rather  than  oil.  At  first, 
over  a  period  of  about  six  hours,  we  developed  an  imperfect  roller 
configuration  but  thereafter  we  lost  the  lubricating  water  film 
to  a  cusp-like  water  film  which  is  characteristic  of  fingering 
instabilities  and  the  formation  of  emulsions  (see  Fig.  15.2).  The 
configuration  of  components  then  assumed  form  as  a  dilute  water  laden 
emulsion  of  water  droplets  in  silicone  oil.  The  water  droplets 
were  continuously  generated  from  the  cusp  shown  in  Fig.  15.2. 

The  droplets  were  very  effective  in  reducing  the  viscosity  of  the 
silicone  oil.  More  and  more  of  the  water  droplets  drifted  to  the 
rod.  After  a  few  days  this  collection  of  drops  reached  a  perco¬ 
lation  threshold  with  rings,  like  wedding  bands,  of  water  around  the 
rod.  After  three  days  all  the  rings  had  collected  into  a  sheet 
°f  water  coating  the  rod  and  encapsulating  the  silicone  oil.  The 
silicone  oil  in  this  configuration  appears  not  to  move,  though  of 
course  there  must  be  some  small  motion  of  the  silicone  oil  due  to 
shearing  by  water.  This  final  configuration  is  shown  in  Plate  14,4. 

The  hydrodynamical  history  of  the  silicone  oil  experiments 
exhibits  encapsulation  phenomena  in  the  form  of  roller  instabilities, 
fingering  instabilities  and  the  generation  of  emulsions  and  finally 
to  an  unambiguous  sheet  encapsulation  of  high  viscosity  silicone  oil 
by  a  lubricating  water  layer. 

The  sheet  encapsulation  can  actually  be  put  into  a  more  dramatic 
form.  In  this  configuration  we  get  a  water  layer  on  the  rod,  completely 
surrounded  by  silicone  oil  as  in  the  idealized  picture  shown  in  Fig. 

7.1.  This  was  achieved  by  withdrawing  some  water  with  a  syringe 
after  the  sheet  coating  with  water,  described  in  the  previous  para¬ 
graph,  has  completely  stabilized.  The  fully  encapsulated  configuration, 
shown  in  Plate  14.5,  is  robustly  stable. 

Arrays  of  rollers  of  STP  lubricated  on  all  sides  by  water  can 
be  achieved  using  the  four  roller  apparatus  of  G,  I.  Taylor  sketched 
in  Fig.  15.3.  The  apparatus  is  filled  with  water  up  to  the  plane 
of  symmetry  between  the  upper  and  lower  pairs  of  rollers,  and  then 
STP  is  added  to  cover  the  upper  rollers.  Rollers  of  STP  develop 
out  of  small  sinusoidal  disturbances  of  initially  uniform  (along 
rod  generators)  interfaces.  The  interpenetrating  rollers  which 
finally  develop  are  steady,  stable  and  lubricated  everywhere  by 
water.  Photographs  of  these  rollers  are  exhibited  in  Plates  15.4(a) 
and  (b)  . 

16.  FINGERING  INSTABILITIES,  THE  FORMATION  OF  AND  LUBRICATION 
BY  EMULSIONS."  In  many^cases"  the  wetting  properties  of  the  rod  or 
the  experimental  conditions  do  not  allow  the  formation  of  lubricating 
sheets  of  low  viscosity  liquid.  In  these  cases  we  get  fingering 
of  low  viscosity  liquid  into  high  viscosity  liquid.  Drops  of  low 
viscosity  liquid  are  torn  off  the  finger  tips  leading  to  the  formation 
of  an  emulsion  of  low  viscosity  drops  in  a  high  viscosity  foam. 
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A  type  of  capillary  instability  may  be  associated  with  drop  form¬ 
ation  from  fingers.  The  emulsions  have  a  very  low  viscosity  and 
they  shield  the  bulk  of  high  viscosity  liquid  from  shearing.  In 
Table  16,1  we  have  summarized  the  experiments  exhibiting  fingering 
into  emulsions. 

The  first  group  of  emulsifying  configurations  are  those 
in  which  a  light  liquid  of  moderately  high  viscosity,  like  vegetable 
oil,  light  machine  oil  or  castor  oil,  is  floated  on  top  of  water, 
or  waterbased  polymeric  solutions,  like  polyacrylamide.  In  all  these 
cases  the  oil  wets  the  plexiglass  rod,  and  after  the  whole  rod  is 
exposed  to  oil,  the  oil  clings  tenaciously  to  the  rod  in  a  narrow 
layer,  even  in  a  monolayer,  in  apparent  but  only  superficial 
contradiction  of  the  lubrication  principle.  When  the  rod  rotates 
slowly  there  is  a  tendency  for  water  to  be  drawn  up  onto  the  rod, 
but  surface  tension  pulls  the  water  back  as  shown  in  Plate  16.1(a). 

At  a  higher  speed  the  water  will  begin  to  finger  into  the  oil, 
depositing  droplets  as  shown  in  Plate  16.1(b).  The  fingering  in¬ 
stability  is  sketched  in  Fig.  16.1.  The  continuous  formation  of  drop¬ 
lets  leads  eventually  to  emulsification  of  water  droplets  in  oil 
foam.  The  emulsified  liquid  then  coats  the  rod  as  shown  in  Plate  16.1(c). 
Instead  of  sheet  coating  we  get  coating  by  water-laden  emulsions. 

These  emulsions  have  very  low  viscosities;  first  they  are 
water-laden,  second,  they  tank  tread  like  roller  bearings  and  they 
seem  to  be  nearly  as  effective  as  sheet  coats  in  shielding  the 
high  viscosity  liquids  from  shearing. 

The  photograph  of  castor  oil  above  polyacrylamide  shown  in 
Plate  16.2  is  a  variant  of  fingering  dynamics  leading  to  drops 
and  emulsions.  In  this,  the  polyacrylamide-water  droplets  are 
encapsulated  at  higher  speed  by  a  sheet  of  low  viscosity  (poly¬ 
acrylamide-water)  liquid. 

A  second  group  of  emulsifying  configurations  are  generated  by 
experimental  conditions  which  prevent  the  generation  of  sheet  coating. 

We  did  some  experiments  in  Box  II  of  the  type  that  led  to  the  forma¬ 
tion  of  the  rollers  shown  in  Plates  15.1(a),  (b)  and  (c) .  The  only 

difference  was  that  the  box  was  filled  to  the  top  and  kept  from 
moving  there  by  a  cover  plate.  We  expected  that  the  phase  configura¬ 
tion  of  minimum  dissipation  would  lead  to  the  capture  of  low 
viscosity  fluid  on  the  rod,  with  most  of  it  on  the  rod  if  the 
densities  were  nearly  matched.  The  difference  between  this  sequence 
of  experiments  and  the  ones  in  §15  is  that  the  cover  plate  forces 
a  kind  of  hydrodynamic  lubrication  at  the  top  of  the  cylinder, 
promoting  fingering  and  the  formation  of  emulsions. 

Realizations  of  the  idea  of  the  foregoing  paragraph  are  shown 
in  Plates  16.3(a)  and  (b) .  As  always,  we  started  with  the  static 
configuration  of  heavy  fluid  below  as  shown  in  Plate  15.1(a). 

In  Plate  16.3(a)  we  see  the  phase  configuration  of  STP  (dark)  and 
TLA  227  (light)  after  a  few  days  of  rotation.  Both  fluids  are  oil- 
based  polymeric  immiscible  liquids.  The  density  of  TLA  227  is  about 
0.005  gm/cm3  greater  than  the  density  of  STP.  It  can  be  seen  in  Plate 
16.3(a)  that  after  a  few  days  much  of  the  STP  had  migrated  to  the 
rod,  and  the  streamlines  carrying  in  the  STP  from  remote  regions 
are  evident.  When  the  (dark)  STP  is  drawn  from  remote  regions  to 
the  rod  it  carries  with  it  some  (light)  TLA  227  by  shearing  action. 

After  a  week  the  color  differences  were  very  faint  and  we  were 
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Table  16.1 

Experiments  in  which  the  low  viscosity  fluid  coats  by  fingering  in  emulsions. 


Fig.  16,1:  Fingering  instability  leading  to  emulsions  of 
water  droplets  in  oil  foam. 


unsure  if  the  bulk  of  the  STP  was  really  on  the  rod.  But  when  we 
stopped  the  rod  the  STP  precipitated  out,  as  illustrated  in  Plate 
16.3(b). 

The  shielding  effects  of  emulsions  is  very  clearly  seen  in  the 
experiment  with  STP  (dark)  above  polyacrylamide  in  water  (clear) , 
shown  in  Plates  16.4(a)  and  (b) .  The  polyacrylamide  is  heavy 
and  much  less  viscous  (viscosity  ratio  of  120) .  The  STP  has  a 
great  affinity  for  the  rod  and  when  we  turned  on  the  rod  it  pulled 
along  a  big  annulus  of  the  high  viscosity  STP,  going  against  our 
idea  that  the  low  viscosity  fluid  coats.  But  after  a  while  the 
small  bubbles  of  polyacrylamide  which  were  being  torn  off  were 
injected  into  the  STP  and  the  whole  of  the  STP  emulsified  after 
about  three  days.  The  parts  of  the  emulsion  having  a  heavier  con¬ 
centration  of  low  viscosity  polyacrylamide  drifted  to  the  rod 
and  all  the  shearing  was  confined  to  the  low  viscosity  emulsion 
near  the  rod.  Then  the  polyacrylamide  in  the  emulsion  near  the 
rod  deposited  polyacrylamide  in  almost  pure  form  onto  the  rod. 

This  appears  as  a  light  ring  of  polyacrylamide  on  the  rotating  rod 
shown  in  Plate  16.4(b).  The  bulk  of  the  STP  is  completely  quiescent, 
being  shielded  from  the  polyacrylamide  by  the  emulsion.  We  think 
that  this  configuration  very  nearly  minimizes  dissipation  but 
we  did  not  anticipate  that  the  hydrodynamics  would  take  on  such 
bizarre  forms.  The  rotational  speed  appeared  to  have  increased  a 
lot  (at  the  same  torque)  in  the  week  and  one  half  of  rod  turning, 
suggesting  a  big  drop  in  the  dissipation  of  the  preferred  phase 
configuration . 

17.  CENTRIFUGAL  AND  TAYLOR  TYPE  INSTABILITIES  IN  THE  FLOW  OF 
IMMISCIBLE  LIQUIDS"  The  instabilities  we_ have  in  mind  are  the  ones 
which  are  commonly  associated  with  an  adverse  distribution  of  angular 
momentum.  For  single  fluids  such  instabilities  are  well  under¬ 
stood  in  certain  circumscribed  circumstances;  Taylor  cells  are  the 
best  known  and  most  important  example.  The  point  of  novelty  here  is 
the  presence  of  two  liquids.  Two  types  of  phenomena  which  occur 
in  our  experiments  are  of  interest.  The  encapsulation  instability 
seems  always  to  position  a  low  viscosity  film  or  emulsion  between  the 
rod  and  a  stagnant  body  of  high  viscosity  liquid.  Such  a  configura¬ 
tion  is  very  conducive  to  the  development  of  an  adverse  distribution 
of  angular  momentum.  The  tendency  for  centrifugal  forces  to  throw 
the  heavy,  less  viscous  liquid  outward  seems  to  be  resisted  by  the 
other  more  viscous  and  still  stable  portion  of  the  fluid.  The 
result  is  apparently  a  cellular  motion,  which  here  can  be  observed 
as  a  deformation  of  the  free  surface.  This  is  the  first  type  of  novel 
phenomenon  which  can  be  seen  in  experiments  with  two  liquids.  Such 
free  surface  cells  are  exhibited  in  Plate  17.1. 

A  second  type  of  phenomenon  develops  in  the  flow  of  immiscible 
liquids  in  a  Taylor  apparatus  set  on  its  side.  The  inner  diameter 
of  the  outer  cylinder  is  6.35  cm,  and  the  length  of  the  space  between 
the  inner  and  outer  cylinders  is  30.48  cm.  The  cylinders  are  made  of 
plexiglass,  the  outer  is  stationary  and  the  inner  is  free  to  rotate. 

Three  inner  cylinders  (diameter  =  5,72,  5.08,  3.81  cm)  were  used 
in  the  experiments.  Every  experiment  was  carried  out  with  equal 
volumes  of  the  two  liquids.  The  apparatus  was  half-filled  with 
heavy  liquid  on  the  bottom  and  light  liquid  on  the  top  as  in  Fig.  17.1(a). 

The  fluid  dynamics  of  the  resulting  flow  is  dominated  by  a  form 
of  Taylor  instability  which  seems  to  be  only  weakly  influenced  by 
gravity.  An  idealized  sketch  of  the  cells  which  develop  is  shown 
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in  Fig.  17.1(b).  High  and  low  viscosity  cells  (e.g.,  oil  and  water) 
separate  each  other.  Examples  of  the  cells  which  actually  do  develop 
are  exhibited  in  the  photographs  shown  as  Plates  17.2-17.6, 
Encapsulation  instabilities  enter  into  the  dynamics  of  these  cells  in 
an  important  way  which  we  specify  below. 

It  is  a  good  idea  to  compute  stability  limits  for  centrally 
located  Couette  flows  of  immiscible  liquids  in  two  layers  with 
gravity  neglected.  This  type  of  calculation  would  lead  to  critical 
conditions  for  the  appearance  of  cells.  For  the  present  we  imagine 
that  there  is  a  critical  Taylor  number,  which  is  given  by  analysis 
for  one  fluid  and  is  in  the  form 


f  =  4«2ri  n2  (1-n)4 

C  7  _  2  n4 


where  is  the  radius  of  the  inner  cylinder,  R2  is  the  radius 

Of  the  outer  cylinder,  n  =  R1/R2 ,  v  is  the  kinematic  viscosity 

and  Q  is  the  angular  velocity  of  the  inner  cylinder.  For  small 
1  -  n  =  £ ,  T  =3416  and 

G 


2  p  ^  p  ^ 

34-16  =  ™  R1 


v 


In  the  experiments  shown  in  the  photographs  exhibited  in  this 
paper  the  liquid  with  the  smaller  viscosity  is  water  with  v  =  0.01 
stokes.  We  may  estimate  e  =  1  -  r\  for  the  water  alone  as 


Then,  as  a  rough  measure  of  critical  conditions  we  have 


C:r  =  ^0|i8(- 

100R^  ,L 


R, 


Ri 
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This  is  an  estimate  of  the  critical  speed  in  water.  The  critical 
speed  for  oil  is  higher;  roughly 

20Q  ,  for  silicone  oil  I  (y  =  19cp) , 


12,000ft  for  STP  (u  =  11000  cp) . 
c 

At  low  speed,  we  see  at  first  a  high  torque,  which  very 
rapidly  drops  to  a  lower  value.  Correlating  this  with  direct  „ 
observation,  we  identify  the  first  significant  dynamical  event  as 
an  encapsulation  instability  in  which  a  sheet  of  water  is  pulled 
around  the  inner  cylinder.  After  this  event,  gravity  seems  less 
important  and  the  flows  tend  more  to  axisymmetry. 

The  next  event  is  the  formation  of  Taylor  cells  in  the 
water  layer.  The  dynamics  associated  with  this  event  are  not 
perfectly  understood  in  detail  but  probably  can  be  roughly  described 
as  follows.  Imagine  that  a  layer  of  water  occupies  the  region 
next  to  the  inner  cylinder,  with  Cp]  =  0  or,  equivalently,  g  =  0. 

As  ft  is  increased  past  criticality  the  smooth  flow  of  water  gives 
up  stability  to  Taylor  vortices.  The  oil  also  moves  very  slowly 
in  cells,  driven  not  by  instability  but  by  shear  stresses  induced 
by  cellular  motion  of  the  dynamically  active  water.  These  motions 
will  naturally  distort  the  oil/water  interface,  as  in  Fig.  17.2(a). 
The  large  amplitude  limit  of  the  flow  in  Fig'.  17.2(a)  is  usually 
like  the  flow  depicted  in  Fig.  17.2(c).  In  this  flow  the  passive 
oil  cells  undergo  extremely  weak  cellular  motions  driven  by  shears 
from  active  water  cells.  The  oil  cells  are  rollers  and  are  lubricated 
at  the  sides  and  at  the  outer  cylinder  by  water.  The  lubrication 
of  oil  rollers  by  water  is  enhanced  by  the  fact  that  water  is 
heavier  than  oil  and  will  tend  to  replace  the  oil  layer  on  the 
outside  of  the  cylinder.  If  the  oil  is  sticky  to  plexiglass  a 
small  layer  of  oil  will  continue  to  adhere  to  the  outer  cylinder 
at  positions  above  the  water  layer  where  the  shears  are  small.  To 
understand  this  it  is  necessary  to  study  Fig.  17.3.  In  the 
experiments  it  is  very  easy  to  see  that  the  azimuthal  component  of 
velocity  of  oil  cells  near  the  outer  cylinder  is  much  greater  than 
the  azimuthal  component  of  velocity  in  neighboring  water  cells. 

This  striking  observational  fact  is  completely  explained  by  the 
encapsulation  of  the  oil  rollers  by  water  at  the  outer  wall. 

When  the  viscosity  of  the  oil  is  not  too  great,  it  is 
possible  to  run  our  apparatus  at  speeds  for  which  both  oil  and 
water  are  very  unstable  to  Taylor  vortices .  In  such  situations 
the  two  components  emulsify  strongly,  forming  one  emulsified 
liquid  which  exhibits  Taylor  cells  in  classical  form  (see  Plate  17.3). 
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water  oil  ( b ) 


Fig-  17.2:  Sketch  of  dynamically  active  water  cells  and 

dynamically  passive  oil  cells  (shaded)  arising 
from  the  instability  in  bicomponent  Couette 
flew  between  rotating  cylinders.  The  situation 
in  (a)  arises  near  criticality  as  an  instability 
of  layered  Couette  flow.  The  situation  in  (b) 
could  be  regarded  as  the  large  amplitude  limit 
of  (a)  when  density  differences  are  negligible. 
The  secondary  flow  in  the  oil  cells  is  extremely 
weak.  Instead  of  (b)  we  usually  see  a  confi~ 
guration  like  ( c )  with  passive  oil  cells  rotating 
as  rigid  rollers  attached  to  the  inner  cylinder 
and  lubricated  by  water  at  the  outer  cylinder. 
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(a)  (b) 


Trig.  17.3:  The  azimuthal  velocity  distribution  in  (a) 
water  cells,  and  (b)  oil  rollers  lubricated 
by  water. 
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Plate  3.1:  One  of  the  static  configurations  of  dibutyl 
phthalate  bubbles  (p  =  1.04  gm/cc)  in  glycerine-water 
solution.  The  mixture  of  glycerine  (p  =  1.25  gm/cc)  and  water 
(p  -  1.00  gm/cc)  is  adjusted  to  match  the  density  of 
dibutyl  phthalate. 
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Plate  14.1:  Encapsulation  of  silicone  oil  I  (p  =  0.96  gm/cc, 
\i  -  19  cp)  by  (a)  corn  syrup /water  solution  (p  =  1.2  gm/cc, 
y  -  12  cp,  blue) ,  and  (b)  water  (blue) .  The  speed  of  the 
rod  is  about  150  RPM. 


(a) 


(b) 


Plate  14.2:  Encapsulation  of  glycerine  (p  =  1.25  gm/cc, 
y  =  1760  cp,  clear)  by  silicone  oil  I  (p  =  0.96  g/cc, 
y  -  10  cp,  red) .  The  speed  of  the  rod  is  100  RPM: 

(a)  front  view,  (b)  side  view. 
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(b) 


Plate  14.3;  Encapsulation  of  glycerine  (p  =  1.25  gm/cc, 
y  -  1760  cp,  clear)  by  castor  oil  (p  =  0.96  gin/cc, 
y  =  700  cp,  red) .  The  speed  of  the  rod  is  about  50  RPM: 
(a)  front  view,  (b)  side  view. 
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Plate  14.4:  Encapsulatipn  of  silicone  oil  III  (p  =  0.95  gm/cc, 
y  =  95,000  cp,  red)  by  water  (clear).  The  speed  of  the  rod 
is  about  65  RPM.  The  rod  is  lubricated  entirely  by  water: 

(a)  back  view,  looking  at  an  angle  from  below;  (b)  side 
view. 


Plate  14.5:  The  same  experiment  as  in  Plate  14.4.  After  the 
configuration  in  Plate  14.4  is  achieved,  some  water  is  with¬ 
drawn  from  the  box  so  as  to  lower  the  level  of  silicone  oil 
below  the  rod.  The  rod  is  surrounded  by  a  very  thin  annular 
layer  of  water  and  the  water  is  surrounded  by  oil.  The 
rod  is  completely  lubricated  by  the  water  and  the  oil  is 
sheared  only  by  water. 

It  is  possible  to  obtain  a  water  lubricated  aluminum 
rod  in  which  the  thickness  of  the  water  layer  is  zero, 
confined  to  a  monolayer  on  the  rod.  In  such  configurations 
the  rod  rotates  at  high  speeds,  but  the  silicone  oil  is 
dead  still.  This  violates  the  no-slip  condition?  it  slips, 
completely  as  in  an  inviscid  fluid.  So  if  we  wish  to  say 
that  fluid  will  stick  to  a  solid  we  must  specify  the  fluid, 
the  solid,  and  say  there  are  no  monolayers,  or  make  other 
quantifying  statements. 
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Plate  15.1:  (a)  The  static  configuration  of  STP  (density 

P  =  0.89  gm/cc,  viscosity  p  =  11000  cp,  brown)  on  water 
(y  =  1  cp)  . 

(b)  Front  view  of  the  box.  The  rod  rotates 
clockwise  at  about  40  RPM.  The  STP  on  the  right  is  nearly 
stationary  and  is  shielded  from  shearing  by  a  thin  layer 
of  water. 


(c)  Side  view  of  the  box  looking  in  from  the 
left  of  Plate  15.1(b).  The  STP  roller  is  also  shielded  from 
shearing  against  the  plexiglass  walls  by  a  layer  of  water. 


Plate  15.2:  (a)  The  same  experiment  as  in  Plate  15.1(b) 

and  (c)  after  some  STP  is  removed  from  the  right  of  the 
box,  resulting  in  a  roller  in  shape  of  an  automobile  tire. 
There  is  a  smaller  layer  of  STP  on  top  of  the  water.  This 
layer  is  separated  from  the  roller  by  a  layer  of  water  main 
tained  hydrodynamically . 

(b)  Side  view  of  the  roller,  looking  in  from 
the  left  of  Plate  15.2(a),  The  roller  has  detached  from 
the  side  walls  under  hydrodynamic  action. 
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Plate  15.3:  (a)  The  formation  of  a  roller  of  silicone 

oil  III  (p  -  0.95  gm/cc,  p  =  95,000  cp)  in  water.  In 
the  beginning  the  rod  rotates  counter-clockwise  at  10  RPM. 

The  torque  is  1.4  lb. -in.  Fingering  instabilities  lead  to 
an  emulsion  of  water  droplets  in  silicone  oil;  (1)  front 
view,  (2)  side  view. 

Three  days  later  as  more  water  droplets  are  formed, 
the  effective  viscosity  of  silicone  oil  decreases  and  the  rod 
rotates  at  a  higher  speed  (16  RPM) .  At  this  speed  the  roller 
is  formed  but  does  not  rotate  as  a  solid  body  since  it  is 
still  attached  to  the  side  wall  (3)  front  view.  (4)  side 


Plate  15.3:  (b)  The  speed  of  the  rod  is  increased.  The 

torque  goes  up  to  above  5  lb. -in.  The  two  photographs 
(1)  and  (2)  ,  which  are  taken  5  seconds  apart,  show  the 
dynamics  through  which  the  roller  detached  itself  from  the 
side  wall.  After  detaching  from  the  wall,  the  torque  goes 
down  to  0.6  lb. -in. 
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Plate  15.3;  (c)  After  detaching  from  the  wall  the  roller 

becomes  unstable.  To  restore  stability  the  speed  of  the 
rod  is  reduced  to  12  RPM.  The  roller  is  stable  but  the  shape 
is  irregular;  (1)  front  view,  (2)  side  view.  After  a 
few  hours  the  roller  molded  itself  into  the  shape  depicted 
in  Fig.  15.1.  The  speed  of  the  rod  and  the  speed  on  the 
surface  of  the  roller  are  almost  the  same  showing  that  the 
roller  is  very  nearly  in  a  solid  body  rotation,  with  small 
shearing  by  water  at  the  roller  rim. 
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Plate  15.4:  (a)  Interpenetrating  rollers  of  STP  and  water 

as  seen  from  the  top  of  the  box  sketched  in  Fig.  15.3.  There 
is  water  on  every  side  of  the  STP  rollers.  The  rollers 
develop  as  an  instability  of  two  rollers  (one  on  each  rod) 
which  are  initially  uniform  along  the  rod.  The  water  surface 
between  them  develops  a  small  wave.  A  grown-up  version 
of  small  waviness  can  be  seen  in  the  above  photograph. 

(b)  Interpenetrating  rollers  of  STP  and  water 
as  seen  from  the  side  of  the  box  sketched  in  Fig.  15.3. 

The  clear  parts  are  of  water. 
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Plate  16-1:  (a)  Light  machine  oil  (p  =  0,831  gm/cc, 

y  -  6.36  cp,  yellow)  on  water.  The  rod  rotates  counter¬ 
clockwise  at  115  RPM.  The  contact  angle  in  this 
experiment  seems  to  be  fixed  with  the  contact  line  slipping 
on  the  rod  in  such  a  way  as  to  stay  fixed  in  space.  The 
configuration  of  the  contact  line  and  the  tenacity  of  the 
contact  angle  even  under  pressure  from  the  intense  water 
circulation  under  the  free  surface  are  noteworthy. 

(b)  The  rod  rotates  at  195  RPM.  This  plate 
shows  the  fingering  instability  in  mature  form  and  how  water 
droplets  are  torn  off  the  fingers. 

(c)  Emulsion  of  water  in  light  machine  oil 

at  300  RPM.  The  water-laden  emulsion  shields  the  main  body 
of  oil  (on  the  top  left  and  right)  from  shearing. 


108 


Plate  16*2;  A  combination  of  emulsion  and  sheet  coating  of 
1%  polyacrylamide/water  (y  =  90  cp,  p  =  l.ol  gm/cc,  blue) 
in  castor  oil  (y  =700  cp,  p  =  0.96  gm/cc,  yellow).  The 
rod  rotates  clockwise  at  1  RPM.  The  rod  is  covered  by 
polyacrylamide/water  droplets  and  these  droplets  in  turn 
are  encapsulated  by  a  sheet  of  polyacrylamide/water  shielding 
them  from  shearing  against  the  high-viscosity  castor  oil. 
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(a) 


Plate  16.3:  (a)  Emulsion  of  STP  (y  =  11000  cp, 

p  =  0.890  gm/cc,  dark  brown)  in  TLA  227  (y  =  20000  cp, 
p  =  0.895  gm/cc,  light  brown).  The  rod  rotates  clock¬ 
wise  at  about  16  RPM.  Streamlines  show  STP  being  drawn 
into  the  rod. 

(b)  Emulsion  of  STP  in  TLA  227.  The  motor 
is  stopped.  A  ring  of  STP  (dark  brown)  precipitates  out 
of  TLA  (light  brown)  around  the  rod. 
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Plate  16.4:  (a)  Emulsion  of  1%  polyacrylamide/water 

(U  =  90  cp,p  =  1.01  gm/cc)  in  STP  (u  =  11000  cp, 
p  =  0.890  g/cc,  dark).  The  polyacrylamide/water 
solution  is  on  the  bottom  and  on  the  rod,  which  rotates 
clockwise  at  about  45  RPM. 

(b)  Close-up  view  showing  the  emulsion  and  the 
ring  of  polyacrylamide/water  solution  around  the  rod. 
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Plate  17.1:  Centrifugal  instability  of  multigrade  motor 
oil  (10W40)  in  water.  The  layer  of  oil  on  the  rod  is 
separated  from  the  main  body  of  oil  by  a  sheet  of  water. 
At  the  speed  of  about  300  RPM  the  oil-water  interface 
becomes  wavy. 
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Plate  17.2:  Rollers  of  silicone  oil  I  (p  =  0.95  gm/cc, 
y  =  19  cp,  red)  separating-  Taylor  cells  of  water  (clear) 

(0,  R~  -  R-jJ  =  (130  RPM,  0.635  cm).  The  azimuthal  velocity 
of  silicone  oil  is  much  larger  than  water,  presumably  because 
of  encapsulation  by  a  thin  film  of  water  (see  Fig.  17,3). 


Plate  17.3;  A  similar  experiment  to  Plate  17.2  at  a  much 
higher  angular  velocity,  (£!,  -  R-^)  =  (1810  RPM,  0.318  cm). 

At  this  velocity  the  oil  and  water  are  both  unstable  and  the 
oil  has  completely  emulsified,  forming  a  single  liquid  in 
which  we  see  classical  Taylor  cells. 
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Plate  17,4:  Rollers  of  STP  (orange)  separating  Taylor 
cells  of  water  (clear);  (ft,  R*  -  R^)  =  (86  RPM,  1,27  cm). 

The  viscosity  of  the  STP  is  11,000  that  of  water,  so  that 
STP  is  always  stable  against  Taylor  instability.  The 
azimuthal  velocity  of  the  rollers  is  much  greater  than  that 
of  the  wave  of  STP  which  sticks  to  the  inner  wall  of  the 
outside  cylinder  above  the  water  (see  Fig.  17.2(c)  and  17.3) 
The  manner  in  which  the  STP  is  fractured  to  avoid  being 
sheared  is  noteworhty. 
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Plate  17.5:  Monograde  motor  oil  (SAE40,  brown)  and  water 
(clear),  (ft,  R2  -  R^  =  (120  RPM,  0.318  cm).  At  this 
speed  only  water  is  unstable,  and  the  dynamics  is  similar  to 
(c)  of  Fig.  17.2.  We  see  rollers  of  oil  separating  Taylor 
cells  of  water.  The  azimuthal  velocity  of  water  is  much 
smaller  that  that  of  oil,  apparently  caused  by  the  layers  of 
oil  sticking  to  the  inner  wall  of  the  outer  cylinder. 


Plate  17.6:  Monograde  motor  oil  (SAE40)  and  water 
(ft,  R?  -  Rx)  (440  RPM,  0.318  cm).  The  light  cells 
are  emulsions  of  oil  with  small  droplets  of  water.  The 
light  cells  are  of  water  with  many  large  drops  of  oil. 

This  is  like  a  two  component  flow  of  two  different  emulsions, 
and  the  dynamics  which  are  realized  seem  to  fall  under 
(b)  of  Fig.  17.2. 
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FLOW  PAST  A  FLEXIBLE  MEMBRANE 


Jean-Marc  vanden-Broeck 
Department  of  Mathematics  and 
Mathematics  Research  Center 
University  of  Wisconsin-Madison 
Madison,  WI  53706 


ABSTRACT .  The  deformation  of  a  two-dimensional  membrane  due  to  the  flow 
of  an  incompressible  fluid  around  it  is  considered.  The  membrane  is  assumed 
to  be  flexible  and  inextensible.  The  problem  is  related  to  the  flow  of  air 
past  parachutes.  Numerical  solutions  are  obtained  by  discretization  of  the 
model  nonlinear  integrodiff erential  equation  describing  the  flow.  The 
numerical  results  are  shown  to  be  in  reasonable  agreement  with  experiments. 

In  addition  some  aspects  of  the  effect  of  porosity  are  discussed, 

I .  Introduction .  We  consider  the  deformation  of  a  two-dimensional 
membrane  due  to  the  steady  potential  flow  of  an  incompressible  inviscid  fluid 
around  it  (see  Figure  1).  We  assume  that  the  membrane  is  flexible, 
inextensible  and  characterized  by  a  constant  tension  T.  This  flow 
configuration  is  relevant  to  the  flow  of  air  past  a  parachute.  We  shall  first 
neglect  the  porosity  of  the  cloth.  Some  aspects  of  porosity  will  be  discussed 
in  the  last  section  of  the  paper. 


We  approximate  the  wake  behind  the  membrane  by  assuming  that  the  pressure 
in  it  is  equal  to  a  constant  P^. 

As  we  shall  see,  the  shape  of  the  membrane  is  characterized  by  the 
dimensionless  parameters 


=p 


P  .  P 

b  9 

1  »1 2 

2  PD 


2  PI,2C 


2  POC 


i  -  c 


(1.1) 


(1.2) 


(1,3) 


(1.4) 


Here  Cp  is  the  pressure  coefficient,  the  tension  coefficient,  CD  the 

drag  coefficient,  P^  the  pressure  far  upstream,  p  the  density  of  air,  C 
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the  distance  between  the  edges  of  the  membrane,  U  the  velocity  far  upstream, 
D  the  drag  per  unit  span  of  the  membrane  and  %  the  length  of  the  membrane. 

This  problem  was  previously  considered  by  Newman  and  Low  [1].  They 
presented  experimental  measurements  of  the  drag  coefficient  and  of  the 
pressure  coefficient  for  various  values  of  s.  In  addition  they  performed 
some  numerical  computations  based  on  the  model  of  Parkinson  and  Jandali  [2] . 
Their  numerical  results  were  found  to  be  in  satisfactory  agreement  with  the 
experimental  data. 


In  the  present  paper  we  compute  accurate  solutions  by  using  the  open-wake 
model.  This  model  was  introduced  by  Joukowsky,  rediscovered  by  Roshko  and 
extended  by  Wu  [3] .  Our  numerical  results  are  found  to  be  in  good  agreement 
with  the  experimental  data  of  Newman  and  Low  [ 1 ] . 

The  problem  is  formulated  in  Section  2  and  the  numerical  results  are 
discussed  in  Section  3.  Some  aspects  of  porosity  are  discussed  in  Section  4. 


I I .  Formul ation .  Let  us  consider  the  steady  two-dimensional  potential 
flow  of  an  inviscid  incompressible  fluid  past  a  membrane  (see  Figure  1).  We 
use  the  open-wake  model  to  describe  the  wake  behind  the  membrane.  In  this 
model  the  portions  CD  and  0*0'  of  the  boundary  of  the  wake  are  described 
by  the  free  streamline  theory.  Downstream  of  the  points  D  and  D'  the 
boundary  of  the  wake  consists  of  two  horizontal  straight  lines.  The  position 
of  the  points  D  and  D1  is  to  be  found  as  part  of  the  solution. 


It  is  convenient  to  introduce  dimensionless  variables  by  using  U  as  the 
unit  velocity  and  c  as  the  unit  length.  We  introduce  the  potential  function 
<p  and  the  stream  function  ip.  Without  loss  of  generality  we  choose  <p  -  0 
at  the  point  B  and  ip  “  0  on  the  membrane  and  on  the  surface  of  the  wake . 
The  flow  configuration  in  the  complex  potential  plane  f  =  <p  +  iip  is 
illustrated  in  Figure  2.  We  denote  by  b  and  d  the  values  of  <p  at  C 
and  D.  These  values  will  be  determined  as  part  of  the  solution. 

We  denote  the  complex  velocity  by  u  -  iv  and  we  define  the  function 
T  -  i0  by  the  relation 


T-i0 

u  -  iv  =  e 


(2.1) 


We  shall  seek  T  -  i0  as  an  analytic  function  of  f  =  <p  +  iip  in  the  half 
plane  ip  <  0 . 


The  Bernoulli  equation  yields 


1_ 

2 


ip  <  0  . 


(2.2) 


Here  q  is  the  magnitude  of  the  velocity  and  P  the  pressure.  In 
dimensionless  variables  (2.2)  becomes 
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(2.4) 


On  the  portion  CD  of  the  surface  of  the  cavity  we  require  P  = 
Therefore  (2.3)  yields 

2t 

e  =  1  -  C  b  <  <j>  <  d,  ty  =  0  • 

Here  Cp  is  defined  by  (1.1). 

On  the  membrane  surface  the  Bernoulli  equation  (2.3)  and  the  pressure 
jump  due  to  the  tension  T  yield 

@2T  +  VT  I?  =  1  -  CP'  0  <  ♦  <  »'  *  ■  0  •  t2-5) 

Here  CT  is  defined  by  (1.2). 

On  the  portion  DE  of  the  boundary  of  the  wake  we  impose  the  condition 

0  =  0,  d  <  $  <  ®,  ij>  =  0  .  (2.6) 

Finally,  the  symmetry  of  the  problem  yields 

0*0,  <<j><0,  4,=  0  .  (2.7) 

At  infinity  we  require  the  velocity  to  be  1  in  the  x-direction. 
Therefore,  T  -  iB  must  tend  to  zero  at  infinity.  The  function  x  -  i0  is 
analytic  in  the  half  plane  ^  <  0  and  vanishes  at  infinity.  Therefore  on 
=  0  its  real  part  is  the  Hilbert  transform  of  its  imaginary  part.  Using 
(2.6)  and  (2.7)  the  Hilbert  transform  yields 

T<4>)  =  7  f  77^  •  (2.8) 

*  Q  <P  9 

The  integral  in  (2.8)  is  of  Cauchy  principal  value  form.  The  functions  x($) 
and  9(<j>)  in  (2.8)  denote  the  values  of  T  and  9  on  =  0. 

Relation  (2.1)  yields  the  indentity 


3x 


+ 


-x+i0 

e 


(2,9) 


Taking  the  imaginary  part  of  (2.9)  and  integrating  from  0  to  b  we  obtain 

b 

/  e"Tsin  0  d$  +  \  =  0  .  (2.10) 

0 

Relation  (2.10)  defines  the  unit  length  as  C. 

Finally,  the  length  of  the  membrane  and  the  drag  coefficient  are  given  by 

b 

s  =  =  2  /  e"Td<j>  -  1  ,  (2.11) 

C  0 
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(2.12) 


CD  =  (1  -  Gp)  -  2  /  e2Ty  d*  . 

0  v 

For  given  values  of  Cp  and  s#  relations  (2.4),  (2.5),  (2,8),  (2.10), 
(2.11)  and  (2.12)  define  a  nonlinear  system  of  integrodiff erential  equations 
for  the  unknowns  9(<j>),  t  (<j>),  b,  d,  CD  and  CT*  This  system  was  discretized 
and  the  resultant  algebraic  equations  were  solved  by  Newton's  method.  The 
details  of  the  numerical  procedure  follow  closely  the  work  of  Vanden-Broeck 
[4] .  Therefore  they  will  not  be  repeated  here. 

III.  Discussion  of  the  results.  Roshko  [5]  derived  an  exact  solution 
for  the  open-wake  model  past  a  flat  plate.  In  particular  he  calculated  the 
following  expression  for  CD 

CD  =  f  {W3(1  +  M2)-1  +  W2 <  1  -  wV’[f  -  (1  +  W2)tan“1W]}"1  .  (3.1) 


Here 


w  =  (i  -  cpr1/2  . 

In  order  to  check  the  accuracy  of  our  numerical  procedure,  we  run  our 
program  with  Cp  =  -1.34  and  s  =  0  for  20,  40,  100  and  150  mesh  points. 
The  respective  values  of  CD  were  2.104,  2.095,  2.092  and  2.091.  This 
sequence  of  values  is  in  good  agreement  with  the  exact  value  2.091  predicted 
by  (3.1).  This  constitutes  an  important  check  on  the  validity  of  our 
numerical  approach. 

The  experimental  results  of  Newman  and  Low  [1]  suggest  that  Cp  is  a 
linear  function  of  s  for  0  <  s  <  0.6.  We  found  that  the  experimental 
values  presented  in  Figure  13  of  their  paper  could  be  fitted  by  the  straight 
line 


Cp  =  -1.3  -  0.6s  .  (3.2) 

The  numerical  scheme  was  run  for  various  values  of  s  and  Cp 
satisfying  (3.2).  Some  of  the  results  are  presented  in  Table  1. 


s 

CP 

CD 

CT 

0.02 

-1.31 

2. 16 

3.34 

0.1 

-1.36 

2,28 

1.71 

0.2 

-1.42 

2.38 

1.41 

0.3 

-1.48 

2.45 

|  1.31 

0.4 

-1.54 

2.52 

1.29 

0,6 

-1.66 

2.66 

1.33 

Table  1 
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The  values  of  CD  and  CT  given  in  Table  1  agree  within  ten  percent 
with  the  experimental  data  presented  by  Newman  and  Low  in  Figures  8  and  18  of 
their  paper.  Therefore  the  open  wake  is  adequate  for  the  prediction  of  Cn 
and  CT. 

IV.  The  effect  of  porosity.  Cumberbatch  [6]  extended  the  open-wake 
model  to  the  flow  past  a  porous  flat  plate.  He  assumed  a  Darcian  flow  through 
the  mesh  and  obtained  a  solution  in  closed  form.  In  addition  he  showed  that 
his  solution  is  in  fair  agreement  with  the  experimental  data  of  Taylor  and 
Davies  [7]. 

Numerical  solutions  for  the  flow  past  a  flexible  and  porous  membrane  can 
be  obtained  by  an  appropriate  generalization  of  the  procedure  described  in 
Section  2.  When  porosity  is  included  the  stream  function  ip  is  no  longer  a 
constant  along  the  membrane.  The  basic  idea  is  to  write  ip  =  F(<p)  along  the 
membrane  and  to  determine  F(<p)  by  imposing  Darcy's  law  across  the  mesh. 
Preliminary  computations  with  s  =  0,  gave  results  in  agreement  with 
Cumberbatch' s  [6]  solution.  Work  is  continuing  on  this  aspect  of  the  problem. 
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VEHICLE  DYNAMIC  ANALYSIS  WITH  FLEXIBLE  COMPONENTS* 
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ABSTRACT .  A  method  is  presented  for  nonlinear,  transient  dynamic 
analysis  of  vehicle  systems  that  are  composed  of  Interconnected  rigid  and 
flexible  bodies.  The  finite  element  method  is  used  to  characterize 
deformation  of  each  elastic  body  and  a  component  mode  technique  is  employed  to 
reduce  the  number  of  elastic  generalized  coordinates.  Equations  of  motion  and 
constraints  of  the  coupled  system  are  formulated  in  terms  of  a  minimal  set  of 
modal  and  reference  generalized  coordinates.  A  Lagrange  multiplier  technique 
is  used  to  account  for  kinematic  constraints  between  bodies  and  a  generalized 
coordinate  partitioning  technique  is  employed  to  eliminate  dependent 
coordinates.  The  method  Is  applied  to  a  planar  truck  model  with  a  flexible 
chassis  and  nonlinear  suspension  components.  Simulation  results  for  transient 
dynamic  response  as  the  vehicle  traverses  a  bump,  including  the  effect  of 
bump-stops,  and  random  terrain  show  that  flexibility  of  the  chassis  can  be 
routinely  accounted  for  and  predicts  significant  effects  on  vibratory  motion 
of  the  vehicle.  Compared  with  a  rigid  body  model,  flexibility  of  the  chassis 
increases  peak  acceleration  of  the  chassis  and  induces  high  frequency  vertical 
acceleration  in  the  range  of  human  resonance ,  which  deteriorates  ride  quality 
of  off-road  vehicles. 


1.  INTRODUCTION.  Modern,  lightweight,  off-road  vehicle  systems, 
operating  over  rough  terrain,  have  placed  increasingly  higher  demands  on  the 
technology  required  to  accurately  model  and  predict  dynamic  response  of  a 
vehicle  system.  In  order  to  predict  dynamic  performance  of  a  vehicle,  it  is 
necessary  to  consider  nonlinear  suspension  kinematics  and  forces,  coupled  with 
elastic  deformation  of  the  vehicle  chassis.  Accurate  description  of  vehicle 
dynamics;  e.g.,  ride  comfort  and  precision  of  armament  subsystems,  requires  a 
high  resolution  mathematical  model  that  accounts  for  flexibility  effects  and 
their  coupling  with  geometrical  and  suspension  force  nonlinearities.  This  is 
mainly  due  to  the  large  number  of  degrees-of-f reedom  required  to  model  vehicle 
components  and  the  high  degree  of  geometrical  nonlinearity  associated  with 
gross  motion  of  suspension  components  and  force-displacement  nonlinearity 
associated  with  suspension  bump— stops.  When  flexibility  is  considered,  the 
problem  becomes  even  more  difficult,  because  of  the  increasing  dimensionality 
and  high  frequencies  of  natural  vibration. 

Some  investigators  [1-2]  have  *  considered  flexibility  of  vehicle 
components.  Their  method  of  analysis  is  based  on  a  linear  theory  that  has 
been  employed  to  analyze  mechanisms  with  flexible  members  [3-5].  In  this 


^Research  Supported  by  Project  No.  DAAG29-82-K-0086  U.S.  Army  Research  Office. 


125 


analysis,  elastic  deformation  is  assumed  to  have  no  significant  effect  on 
gross  motion.  Gross  motion  is  first  determined,  using  rigid  body  analysis  and 
the  resulting  inertia  and  reaction  forces  are  introduced  in  elastic  analysis 
of  the  components.  The  total  motion  of  the  elastic  member  is  then  obtained  by 
superposition  of  small  deformation  on  gross  body  motion.  There  is,  however, 
an  increasing  demand  to  produce  lighter  weight  vehicular  components  that 
operate  at  higher  speeds.  Linear  theory  assumptions  are  no  longer  accurate 
enough  to  represent  system  dynamics,  since  flexibility  effects  can 
significantly  affect  motion  at  the  driver’s  station. 

Sunada  and  Dubowsky  [6]  recently  presented  a  method  for  the  dynamic 
analysis  of  flexible  mechanisms  that  couples  flexible  degrees  of  freedom  with 
a  geometrically  nonlinear  set  of  equations  of  motion.  Existing  finite  element 
structural  programs  are  combined  with  a  4x4  matrix  dynamic  analysis 
technique.  The  method  has  been  applied  to  analyze  spatial  mechanisms  and 
robotic  manipulators.  The  capability  of  this  method  to  treat  applications 
such  as  vehicle  systems  and  space  structures  without  substantial  ad-hoc 
formulation,  is  not  clear.  Further,  this  method  neglects  rotary  inertia  of 
mass  that  is  lumped  at  individual  grid  points ,  in  order  to  avoid  the 
difficulty  of  using  a  consistent  mass  approach  to  represent  inertia  coupling 
between  the  rigid  body  motion  and  the  elastic  deformation. 

Shabana  and  Wehage  [7-8]  presented  a  method  for  dynamic  analysis  of  large 
scale  inertia-variant  flexible  systems  with  coupled  reference  and  elastic 
deformation.  Each  flexible  body  is  represented  by  two  sets  of  generalized 
coordinates.  The  first  set  defines  the  location  and  orientation  of  a  body- 
fixed  coordinate  system  that  is  rigidly  attached  to  a  point  on  the  flexible 
body.  Second,  elastic  generalized  coordinates  characterize  small  deformation 
relative  to  the  body-fixed  system.  This  set  of  coordinates  is  introduced 
using  the  finite  element  method  of  structural  analysis.  Modal  analysis  is 
employed  to  reduce  the  number  of  elastic  degrees  of  freedom,  hence  reducing 
problem  dimensionality  to  manageable  extent. 

The  purpose  of  this  paper  is  to  adapt  the  automated  analysis  method  of 
Refs  7-8  for  coupled  dynamic  analysis  of  planar  vehicle  systems  that  are 
composed  of  rigid  and  flexible  bodies.  As  a  numerical  example,  a  cross 
country  truck  is  considered  in  which  the  chassis  is  flexible.  This 
investigation  is  mainly  concerned  with  analysis  of  the  effect  of  chassis 
flexibility  on  dynamic  response  of  the  vehicle,  over  a  single  bump  and  random 
terrain.  A  rather  simplified  tire  model  is  used  in  this  study.  Extension  of 
the  formulation  presented  and  illustrated  here  to  include  a  more  realistic 
tire  model  and  three  dimensional  structural  vibration  [9]  is  theoretically 
simple  but  involves  more  detailed  calculations  than  are  presented  here. 

To  achieve  the  above  goals,  the  DADS  computer  program  is  used  to 
automatically  generate  equations  of  motion,  using  a  Lagrangian  formulation. 

The  system  equations  of  motion  are  solved  numerically  using  a  direct 
integration  method  and  advantage  is  taken  of  sparsity  of  the  matrices  arising 
in  the  formulation.  As  is  shown,  this  automated  formulation  is  general  and 
conserves  manpower  that  would  be  required  in  ad-hoc  model  formulation  and 
analysis. 
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2*  VEHICLE  AND  ROAD  SURFACE  MODELS 


2*1  Vehicle  Model,  The  vehicle  used  in  this  investigation  is  a  5  ton, 
cross  country  truck  [10]  with  rigid  axles  and  a  Watt  mechanism  independent 
suspension*  Figure  1  shows  the  general  configuration  of  the  truck.  Figure  2 
illustrates  the  wheel  suspension,  with  two  suspension  support  arms  beneath  the 
axle  and  two  arms  above.  Vertical  forces  are  supported  by  coil  springs  of 
progressive  stiffness.  Figure  3  is  an  overall  view  of  the  frame,  which 
consists  of  two  sidemembers  (closed  box  grlders)  and  several  tubular 
crossmembers  that  are  mounted  in  holes  in  the  sidemembers  and  welded  to  their 
inner  and  outer  sides.  Vehicle  parameters  used  In  this  analysis  are  given  in 
Table  1. 


Table  1  Vehicle  Parameters 


Parameter 


Value 


Gross  Vehicle  Mass 
(including  payload) 

Vehicle  Sprung  Mass 

Vehicle  Unsprung  Mass 

Front  axle 

Rear  axle 

Long  trailing  arms 
Short  trailing  arms 

Pitch  moment  of  inertia  of  sprung  mass 
(about  C*M.) 

Front  and  Rear  Suspension 

Spring  rate  (per  spring) 

Damping  rate  (per  shock  absorber) 

Compression 

Rebound 

Wheel  Travel  (unloaded) 

Jounce 

Tire  Quadratic  Spring  Constant  (per  tire) 

Damping  rate  (per  tire) 

Tire  Radius 


14,400 

kg 

11,950 

kg 

2,450 

kg 

1185.0 

kg 

1185.0 

kg 

44.8 

kg 

35.2 

kg 

58300,0 

kg-m 

6*91x10"*  N/m 

5480*0  N, sec/m 
17575,0  N. sec/m 

0.15  m 

5.649xl07  N/m2 
4625.0  N,sec/m 
0.6  m 


Vertical  Natural  Frequency  of  Sprung  Mass 
Sprung  Pitch  Natural  Frequency 


1.98  Hz 
1.95  Hz 
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g.  1  5  Ton  4x4  Cross  Country  Truck 


Fig.  2  Wheel  Suspension 


Fig.  3  Main  Frame 


A  simplified,  planar  rigid  body  truck  model  is  shown  in  Fig.  4.  Bodies  1 
and  2  are  the  front  and  rear  axle-wheel  assemblies,  respectively.  Each  axle- 
wheel  mass  is  assumed  to  be  concentrated  at  the  wheel  center.  Body  3  is  the 
chassis  of  the  truck.  The  mass  of  the  chassis  includes  masses  of  the  payload 
and  engine.  Body-fixed  coordinate  axes  are  located  at  the  centroid  of  each 
body.  Bodies  4,  5,  6  and  7  are  the  trailing  arms  that  connect  the  chassis  and 
wheel  assemblies  by  revolute  joints.  The  function  of  these  trailing  arms  is 
to  provide  kinematic  control  of  the  axle  position  and  to  absorb  driving  and 
braking  torques  acting  on  the  wheels.  Therefore,  they  are  modeled  as  a  Watt’s 
mechanism,  which  gives  very  small  rotation  to  the  axle  during  vertical 
displacement.  Rigid  body  data  and  the  initial  location  and  orientation  of  the 
body-fixed  coordinate  systems  of  each  body,  with  respect  to  the  inertial 
reference  frame,  are  given  in  Table  2, 

The  suspension  springs  and  dampers  and  the  tires  are  modeled  by  springs 
and  dampers,  as  shown  in  Fig.  5.  Spring  characteristics  of  the  tires  are 
taken  here  as  quadratic  functions  of  displacement,  A  simple  point  contact 
tire  model  is  used  to  simulate  tire  forces  that  occur  due  to  motion  of  the 
wheel  relative  to  the  road  surface.  Fore-and-aft  force  components  are 
neglected,  assuming  the  tire  force  is  always  vertical.  The  tire  is  free  to 
leave  the  ground,  to  simulate  wheel  hop.  Nonlinear  spring  and  damping 
characteristics  of  suspension  elements  are  given  in  Fig.  6.  The  high 
stiffness  of  the  suspension  spring  in  compression,  when  the  spring  deflection 
is  greater  than  0.15  m,  simulates  the  bump-stop  in  the  suspension  system. 

A  rigid  body  vehicle  model  of  this  vehicle,  with  five  degrees-of-f reedom 
(5-DOF),  is  also  formulated  to  allow  evaluation  of  the  effects  of  chassis 
flexibility. 


Table  2  Rigid  Body  Data  and  Initial  Positions 


Body  No. 

Mass  (kg) 

Moment  of  Inertia 
about  C.G,  (Kg-m2 ) 

Initial  Body  Coordinates 
X(m)  y(m)  <j>(rad) 

1 

1185 

13.33 

1,500 

0.575 

0.0 

2 

1185 

13.33 

5.850 

0.575 

0.0 

3 

11950 

58300 

3.675 

0.975 

0.0 

4 

22.4 

2.38 

2.116 

0.725 

0.464 

5 

22.4 

2.38 

5.234 

0.725 

-0.464 

6 

17.6 

1.14 

1.066 

0.909 

-0.154 

7 

17.6 

1.14 

6,284 

0,909 

0.154 

2.2  Road  Surface  Models.  The  dynamic  response  of  a  vehicle  depends 
strongly  on  the  vertical  displacement  history  of  the  wheels  on  the  road 
surface.  In  this  investigation,  two  roadway  models  are  used,  as  shown  in 
Fig.  7,  Figure  7(a)  represents  a  simulated  obstacle  with  0.2m  height  and  0.4m 
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(cm  ) 


width,  which  is  used  to  simulate  shock  response  of  the  vehicle  over  a  single 
bump.  Figure  7(b)  is  a  terrain  profile  with  a  RMS  roughness  of  2,64cm  (1.04 
in)  and  a  length  of  91.44m  (300  ft). 

3,  ANALYTICAL  APPROACH.  The  analysis  method  employed  in  this 
investigation  is  similar  to  the  method  used  in  Refs,  7-8  to  analyze  mechanical 
systems  with  interconnected  rigid  and  flexible  bodies.  In  this  method,  the 
chassis  of  the  vehicle  is  considered  as  a  deformable  substructure.  Two  sets 
of  generalized  coordinates  are  employed  to  describe  the  flexible  body 
configuration.  First,  reference  generalized  coordinates  define  the  location 
and  orientation  of  a  body-fixed  coordinate  system  on  each  body.  Second,  a  set 
of  elastic  coordinates  define  small  deformation  of  each  body,  relative  to  its 
body-fixed  coordinate  system.  This  set  is  introduced  using  the  finite  element 
method. 


Kinetic  and  strain  energy  expressions  are  developed  for  the  individual 
elements.  The  kinetic  and  strain  energy  of  each  body  are  obtained  by  summing 
energies  of  its  elements.  Constraints  between  different  elements  of  a  body 
are  expressed  in  a  Boolean  form  and  constraints  between  different  bodies  are 
introduced  using  a  Lagrange  multiplier  technique.  The  generalized  coordinate 
partitioning  method  [11]  and  a  component  mode  structural  analysis  technique 
are  employed  to  describe  the  system  equations  of  motion,  with  a  minimal  set  of 
independent  generalized  coordinates  [7,8].  The  method  of  Refs.  7  and  8  is 
summarized  here,  for  completeness. 


3.1  Energy  Expressions.  Figure  8  shows  a  typical  element  j  of  a  two 
dimensional  planar  flexible  body  i.  Let  the  x-y  coordinate  system  represent 

an  inertial  reference  frame  and  the  x^y1  system  represent  a  coordinate  system 
that  is  rigidly  attached  to  body  i. 

The  location  of  an  arbitrary  infinitesimal  volume  at  point  on  element 
j  can  be  defined  as 


R1^  =  R1  + 

-v  -  “ 


(1) 


i  x  i  T 

where  R  =  [x  ,y  ]  is  the  vector  of  translational 
of  the  coordinate  system  of  body  1  with  respect  to 


coordinates  of  the  origin 
the  x-y  system, 


"cosO^^  -sin0i  ' 
sin0^  cos0i 


(2) 


is  the  transformation  matrix  from  x^-y*  to 
the  position  vector  of  p*^  with  respect  to 

JLij  -  +  ^ 


x-y  coordinate  systems,  and  is 
the  x^y*  system,  defined  as 

(3) 
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where  is  the  position  vector  of  in  the  undeformed  state  and  is  the 
elastic  displacement  vector  in  the  body  fixed  coordinate  system.  Let  an 
xij-yij  coordinate  system  be  attached  to  the  left  end  of  element  j.  Using  a 
shape  function,  can  be  expressed  in  terms  of  nodal  coordinates 

(k  =  1,2... 6),  which  represent  nodal  coordinates  and  slopes  of  reference 
lines  at  nodes,  relative  to  the  x^-y*  system, 


r1*  -  N^eij 


(4) 


where  is  the  element  shape  function. 

From  Eq.  1,  the  position  vector  can  be  expressed  [7-8],  in  terms  of 
reference  coordinates  (x1 ,  61 )  and  nodal  coordinates  (e^ ) ,  as 

=  R1  4-  (5) 

Differentiating  Eq,  5  with  respect  to  time  gives 

&  a'  +  W  +  AiNi^  (6) 

where 

A" 


^i  _  gi  -sine*  -cos0i 

_  cos8*  -sine1. 


0*A* 


(7) 


Substituting  Eq.  7  into  Eq.  6  and  writing  the  result  in  partitioned  form 
yields 


= 

“P 


I  A1  VV3  ]  {  # 


(8) 


The  kinetic  energy  expression  for  element  ij  is  given  by 


rij  1 


=  4/ 


plj  Rlj  Rlj  dViJ 


2  J  TTi  j  -p  -p 


1  •  ij  wi j  *ij 
“  M  1 


(9) 
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where  V1^  is  the  element  volume,  is  density  of  the  element  material, 

T  T  T 

=  [  R1  0±  ]  (10) 

are  the  generalized  coordinates  of  element  ij  and  MiJ  is  the  element  mass 
matrix  [7-8],  The  vector  e1^  can  be  written  as 


ij 


ij  ij 

V  +  V 


(ID 


where  is  the  vector  of  nodal  coordinates  in  the  undeformed  state  and  q^ 

is  the  vector  of  deformations  at  the  nodes,  defined  with  respect  to  the  body- 
fixed  coordinate  system# 

The  total  kinetic  energy  of  body  i  is  given  by 

i 

T1  =  Z  T1^ 

j-I 

T 


-  1  A1 

2  -3- 


M*  q1 


(12) 


T  T  T  T 

where  q1  =  [  R1  91  %  ]  =  [  4  % 


,  and  q^  represent. 


respectively,  reference  and  elastic  coordinates  of  body  i#  The  strain  energy 
of  body  i  can  also  be  expressed  in  compact  form  as  [7-8] 


i  1  iT  „i  1 

o  =Tl  K  1 


(13) 


where  K1  is  the  stiffness  matrix  of  body  i. 

The  virtual  work  of  external  forces  acting  on  body  i  can  be  written  as 

(14) 


.T 

5W1  =  Q1  6q± 


where  Q1  is  the  vector  of  generalized  forces  associated  with  the  generalized 
coordinates  of  body  i# 


3,2  Equations  of  Constraint,  When  adjacent  bodies  are  connected, 
nonlinear  constraint  equations  are  written  between  adjacent  bodies  and  a 
Lagrange  multiplier  method  is  employed  to  adjoin  these  constraint  equations  to 
the  equations  of  motion*  These  constraints  permit  the  joining  of  elastic 
bodies,  rigid  bodies,  or  rigid  and  elastic  bodies.  Points  of  attachment  on 
elastic  bodies  are  at  nodes  of  the  finite  element  model.  In  general, 
equations  of  constraint  can  be  written,  in  vector  function  form,  as 
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£(q_,t)  -  o 


(15) 


T 

where  $(q,t)  *  [  L(q , t )  , -  - .  »t ) ]  .  This  is  a  set  of  nonlinear  algebraic 

equations,  which  can  be  used  to  describe  constraints  between  vehicle 
components. 

3.3  Equations  of  Motion.  The  composite  vector  of  all  system  generalized 

___  t  T 

IT  2T  N 

coordinates  is  designated  as£=[_q  ,  q.  .••••€[  ],  where  N  is  the  total 

number  of  bodies  (substructures)  in  the  system.  The  constraint  equations  of 
Eq,  15  are  assumed  to  be  independent.  Presuming  that  the  constraints  are 
workless,  the  variational  form  of  the  equations  of  motion  [12]  for  body  i, 
where  subscript  notation  denotes  differentiation  with  respect  to  a  vector,  is 


d 

dt 


A  iT  i 
+  IT  -  Q  65  =  0 

q1 


(16) 


for  all  virtual  displacements  <$q_  that  are  consistent  with  constraints  of  Eq. 

“  T 

15.  It  can  be  shown  that  introducing  the  vector  X_  into  Eq.  16  allows  the 

coefficients  of  6^*  to  be  set  to  zero  [13].  Thus, 


T1 

dt  •  i 

q 


-  T 


i  T 

-  £ +  ±  .X 


=  0 


(17) 


_  a  1  1 

with  T^,  U^,  and  given  by  Eqs,  12,  13,  and  14. 

3.4  Generalized  Coordinate  Reduction.  Efficient  solution  of  the  system 
equations  of  motion  requires  a  transformation  from  the  space  of  system  nodal 
generalized  coordinates  to  the  space  of  system  modal  generalized  coordinates, 
which  has  lower  dimension.  The  method  presented  in  Refs  7-8  is  based  on 
solving  the  eigenvalue  problem  for  each  substructure  once.  From  Fourier 
analysis  of  the  forcing  functions,  an  initial  estimate  of  the  number  of  modes 
to  be  retained  is  made.  During  the  simulation,  additional  eigenvectors  are 
recalled  or  deleted,  as  required.  For  the  purpose  of  determining  eigenvalues 
and  eigenvectors,  if  a  substructure  is  assumed  to  vibrate  freely  about  a 
reference  configuration,  Eq,  17  yields 


4  4 +  4  4 


(18) 


Where  M*f  and  K*f 


are  the  mass  and  stiffness  matrices  associated  with  the 
-i 


nodal  generalized  coordinates  and  is  the  vector  of  elastic  coordinates 
after  imposing  the  body-fixed  coordinate  conditions.  The  stiffness 
matrix  is  positive  definite,  because  the  reference  coordinate  system  is 
fixed.  Equation  18  yields  a  set  of  eigenvectors  and  a  modal  matrix.  A 
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coordinate  transformation  from  the  physical  nodal  coordinates  to  modal 
coordinates  is  defined  by 

if  =  x1 

where  B*  is  the  mocial  matrix,  consisting  of  the  eigenvectors  obtained  from  Eq. 

18  and  x  is  a  vector  containing  the  modal  coordinates.  Using  Eq.  19,  the 
reference  and  nodal  generalized  coordinates  are  written  in  terms  of  reference 
and  modal  coordinates.  A  substantial  reduction  in  problem  dimensionality  can 
be  achieved  by  considering  only  significant  modes. 

4.  NUMERICAL  RESULTS.  In  order  to  take  the  effect  of  flexibility  of 
vehicle  components  on  global  vehicle  motion  into  account,  the  chassis  (Body  3) 
and  long  trailing  arm  (Body  4)  are  modeled  as  elastic  bodies.  The  chassis  and 
long  trailing  arm  are  divided  into  12  and  2  finite  beam  elements , 
respectively,  with  reference  coordinates  located  at  their  midpoints.  The 
flexural  rigidity  of  each  flexible  member  is  calculated  using  the  cross- 
sectional  area  of  the  beam  and  its  material  properties.  Lateral  and  axial 
deformation  are  considered. 

The  flexible  components  are  initially  treated  as  substructures  that  are 
fixed  at  their  midpoints.  Since  each  beam  element  has  6  degrees-of-f reedom, 
the  flexible  chassis  has  a  total  of  36  elastic  degrees-of-freedom  and  each 
flexible  link  has  6  elastic  degrees-of-freedom.  The  eigenvalue  problem  is 
solved  for  each  of  these  substructures.  The  lowest  six  natural  frequencies  of 
the  flexible  chassis  are  6.11,  6.11,  38.31,  38.31,  83.36,  and  83.36  Hz,  where 
the  first  four  modes  are  bending  modes  and  the  fifth  and  sixth  modes  are  axial 
vibration  modes.  The  lowest  two  natural  frequencies  of  the  flexible  links  are 
88.92  and  88.92  Hz.  One  percent  structural  damping  is  considered  for  every 
chassis  mode  of  vibration, 

4.1  Vehicle  Response  over  Single  Bump.  The  vehicle  travels  over  the 
single  bump  given  in  Fig,  7(a),  with  a  vehicle  speed  of  3  m/sec  (6.7 
miles/hr) *  The  simulation  is  carried  out  for  two  vehicle  models,  rigid  and 
flexible  chassis,  to  evaluate  flexibility  effects  of  the  chassis  on  vehicle 
motion.  To  compare  higher  mode  effects  of  the  flexible  chassis  model,  2-  and 
4-mode  solutions  are  obtained.  Figure  9  shows  the  vertical  displacement  of 
the  center  of  mass  of  the  chassis,  from  its  static  equilibrium  position.  The 
figure  shows  significant  peak  differences  between  vertical  displacement  of 
rigid  and  flexible  chassis  models.  Figure  9  also  shows  that  the  contribution 
of  higher  vibration  modes  to  vertical  displacement  of  the  chassis  is 
negligible. 

The  vertical  acceleration  at  the  center  of  mass  of  the  chassis  is  given 
in  Fig.  10,  for  each  model.  It  shows  that  chassis  flexibility  results  in 
increased  peak  acceleration  and  significantly  higher  frequency  content  during 
passage  over  the  bump.  The  effect  of  structural  damping  on  the  vehicle 
response  is  shown  in  Fig,  11.  In  this  figure  the  vertical  acceleration  of  the 
chassis  with  and  without  damping  are  plotted.  The  damped  response  decays  with 
time,  while  the  undamped  response  has  a  sustained  oscillation. 
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VERT.  DISP.  OF  CHASSIS  C.G 


Fig.  11  Effect  of  Structural  Damping  on  Vertical  Acceleration  of  Chassis 
C.G.  over  Single  Bump,  Flex,  Chassis  Model,  2  Modes 


Rigid  Body  Nodal 

Flex.  Chassis  Model,  4  Modes 


Computation  time  for  the  rigid  model  simulation  was  7.0  minutes  on  a 
PRIME  750  supermini  computer.  Computation  times  with  the  two  and  four  mode 
flexible  chassis  models  were  2.08  and  5.18  times  the  computation  time  of  the 
rigid  model.  This  shows  that  computational  efficency  can  be  obtained  by  using 
the  smallest  number  of  modes  required  to  obtain  reasonable  accuracy. 

4,2  Vehicle  Response  over  Random  Terrain.  Simulation  is  carried  out 
over  the  terrain  given  in  Fig.  7(b)  for  the  rigid  model,  the  two  and  four  mode 
flexible  chassis  models,  the  two  mode  flexible  links  model,  and  the  two  mode 
flexible  chassis  and  links  model.  Vehicle  velocity  is  7.5  m/sec.  (16.8 
miles/h).  Results  are  given  in  Figs,  12  to  14,  Since  link  flexibility  does 
not  have  significant  effect  on  the  global  vehicle  motion,  results  for  the 
model  with  flexible  links  are  not  included. 

Figure  12  shows  vertical  displacement  at  the  center  of  mass  of  the 
chassis.  No  significant  difference  is  observed  between  rigid  and  flexible 
models.  No  significant  difference  has  been  found  between  the  two  and  four¬ 
mode  solution  of  the  flexible  chassis  models.  It  has  also  been  found  that 
there  is  no  significant  difference  in  vertical  chassis  displacement  between 
the  two  mode  flexible  chassis  and  links  model  and  the  two  mode  flexible 
chassis  model.  It  is  concluded  that  flexibility  effects  of  the  stiff  link 
(which  has  relatively  high  natural  frequency)  on  the  vehicle  response  may  be 
neglected. 

Figure  13  shows  vertical  acceleration  at  the  center  of  mass  of  the 
chassis.  Flexibility  of  the  chassis  results  in  a  significant  increase  in  the 
acceleration  level  at  the  center  of  the  chassis  and  high  frequency  content 
near  the  resonant  frequency  of  the  human  body.  Vibration  in  the  chassis  may 
thus  result  in  an  unpleasant  motion  and  deteriorate  ride  comfort  of  the 
vehicle.  Suspension  link  flexibility  does  not  have  noticeable  effects  on  the 
vertical  acceleration  of  chassis. 

Deflection  of  the  front  end  of  the  chassis,  with  respect  to  its  body- 
fixed  coordinate  system,  is  given  in  Fig.  14.  Dynamic  peak  deflections  for 
the  two  mode  flexible  chassis  model  is  22  times  the  static  deflection  of  that 
model.  The  frequency  of  vibration  of  the  front  of  the  chassis  is  about  6  Hz, 
which  is  the  fundamental  natural  frequency  of  the  chassis. 
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Rigid  Body  Model 

Flex.  Chassis  Model.  2  Modes 


Fig.  13  Vertical  Acceleration  of  Chassis  C.G.  over  Random  Terrain, 

2  Modes 


References 


1*  Vail,  C.F.,  "A  Modal  Synthesis  Technique  for  Determining  Properties  for 
Structures  for  Mass  and  Stiffness  Changes,"  SAE  paper  740329,  1974* 

2-  Elmadany,  M.M. ,  Dokanish,  M.A. ,  and  Allan,  A.B.,  "Ride  Dynamics  of 

Articulated  Vehicles  -  A  Literatures  Survey,"  Vehicle  Systems  Dynamics, 
Vol.  8,  1979,  pp. 289-316. 

3,  Winfrey,  R.C.,  "Elastic  Link  Mechanism  Dynamics,"  ASME  Journal  of 
Engineering  for  Industry,  Feb.  1971,  pp.  268-272. 

4*  Sadler,  J.P.,  and  Sandor,  G.N. ,  "A  Lumped  Parameter  Approach  to 

Vibration  and  Stress  Analysis  of  Elastic  Linkages,"  ASME  Journal  of 
Engineering  for  Industry,  May  1973,  pp.  549-557. 

5.  Erdman,  A.C.,  Sandor,  G.N.,  and  Oakberg,  R.G.,  "A  General  Method  for 
Kineto-Elastodynamlc  Analysis  and  Synthesis,"  ASME  Journal  of 
Engineering  for  Industry,  Nov.  1972,  pp.  1193-1205 

6.  Sunada,  W. ,  and  Dubowsky,  S.,  "The  Application  of  Finite  Element  Methods 
to  the  Dynamic  Analysis  of  Flexible  Spatial  and  Co-planar  Linkage 
Systems,"  ASME  Journal  of  Mechanical  Design,  July  1981,  Vol.  103, 

pp.  643-651. 

7.  Shabana,  A.  and  Wehage ,  R.A. ,  "Variable  Degree  of  Freedom  Component  Mode 
Analysis  of  Inertia-Variant  Flexible  Mechanical  Systems,"  ASME  paper  No, 
82-DET.  93,  Journal  of  Mechanisms,  Transmission  and  Automation  in 
Design,  to  appear, 

8.  Shabana,  A.,  and  Wehage,  R.A. ,  "A  Coordinate  Reduction  Technique  for 
Dynamic  Analysis  of  Spatial  Substructures  with  Large  Angular  Rotations," 
Journal  of  Strucutral  Mechanics ,  to  appear,  Dec.  1983. 

9.  Shabana,  A. A.  and  Wehage  R.A. ,  "Spatial  Transient  Analysis  of  Inertia- 
Variant  Flexible  Mechanical  Systems",  Submitted  to  ASME  Journal  of 
Mechanical  Design,  Jan.  1983. 

10.  Klanner,  R. ,  "Cross-Country  Truck  for  Fast  Off-Highway  Navigation,  6th 
International  Conference  of  International  Society  of  Terrain  and  Vehicle 
Systems,"  Vienna,  Aug,  1978,  pp.  22-25. 

11.  Wehage,  R.A.  and  Haug,  E.J.,  "Generalized  Coordinate  Partitioning  for 
Dimension  Reduction  in  Analysis  of  Constrained  Dynamic  Systems,"  ASME 
Journal  of  Mechanical  Design,  Vol.  104,  1982,  pp*  247-255* 

12.  Goldstein,  H.,  Classical  Mechanics,  Addison  Wesley,  1980. 

13.  Haug,  E.J.  and  Arora,  J.S.,  Applied  Optimal  Design,  Wiley  1979. 


142 


APPLICATION  OF  SYMBOLIC  COMPUTATION 
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B.  Noble,  Mathematics  Research  Center,  University  of  Wisconsin 


SUMMARY* 

This  paper  illustrates  the  application  of  symbolic  computation  in  connection  with 
three  aspects  of  mechanical  systems; 

1.  The  derivation  of  dynamical  equations  by  Lagrangian  methods. 

2.  The  analysis  and  synthesis  of  kinematic  mechanisms. 

3.  A  robot  manipulator  arm. 


INTRODUCTION 

This  paper  illustrates  the  potential  of  symbolic  computation  in  connection  with  the  formula¬ 
tion  and  analysis  of  equations  for  dynamical  systems,  sensitivity  analysis,  linkages  and  mecha¬ 
nisms,  and  robot  manipulator  arms. 

*  This  is  a  preliminary  version  of  a  paper  to  appear  in  the  proceedings  of  NATO  Conference  on  Mechanisms, 
University  of  Iowa,  1983. 
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We  use  MACSYMA  (project  MAC’s  SYmbolic  MAnipulation  system) ,  a  large-scale  com¬ 
puter  program  for  symbolic  mathematical  computation.  MACSYMA  can  handle  polynomial 
manipulation,  simplification  and  substitution  with  symbolic  expressions,  symbolic  solution  of 
algebraic  and  differential  equations,  and  matrix  manipulation.  Although  we  have  found 
MACSYMA  particularly  convenient  to  use,  other  symbolic  programs  such  as  REDUCE  and 
SMP  could  be  used. 

This  report  deals  with  three  topics: 

1.  The  derivation  of  dynamical  equations  by  Lagrangian  methods,  including  sensitivity 
analysis  (Sections  1,  2,  3,  4,  and  5). 

2.  The  analysis  and  synthesis  of  kinematic  mechanisms,  including  dual-number  quatern¬ 
ions  (Sections  7  and  8). 

3.  The  direct  and  inverse  problem  involving  robot  manipulator  arms  (Sections  9  and  10). 

In  order  to  make  the  presentation  clearer  to  the  general  reader  who  lacks  specialized 
knowledge  of  symbolic  manipulation,  we  explain  the  mathematical  aspects  in  the  main  text 
(namely  the  kind  of  problem  for  which  we  feel  symbolic  computation  is  useful),  and  give  the 
detailed  MACSYMA  programs  in  appendices. 

In  a  certain  sense  the  real  “meat”  of  this  paper  is  the  detailed  programs  which  appear  in 
the  appendices.  The  reader  interested  in  symbolic  manipulation  should  solve  the  problems 
outlined  in  the  text  using  MACSYMA,  or  any  other  suitable  program,  with  the  appendices  as 
a  guide. 

The  objective  of  this  paper  is  to  encourage  the  use  of  symbolic  manipulation  in  the  analy¬ 
sis  of  mechanical  systems.  It  is  clear  that  the  complexity  of  the  problems  being  tackled  is  in¬ 
creasing  to  the  point  where  symbolic  manipulation  must  play  an  important  role  in  their  for¬ 
mulation  and  solution.  In  this  paper  we  have  simply  picked  out  the  tedious  parts  of  well 
known  methods  and  examples,  and  illustrated  the  ease  of  performing  the  manipulation  using 
MACSYMA. 


1.  DESIGN  OF  A  5-DEGREES-OF-FREEDOM  VEHICLE  SUSPENSION 

The  objective  of  this  example  is  to  illustrate  how  MACSYMA  handles  Lagrange’s  equation 
of  motion  in  the  form: 


d_ 
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_1T 

a  9,  a«,  u> 


/  =  1 ,  •  •  •  ,  n 


(1) 


where  T  and  V  are  quadratic  forms  representing  kinetic  energy  and  potential  energy,  respec¬ 
tively,  expressed  in  terms  of  generalized  coordinates  qh  Q,  are  nonconservative  generalized 
forces.  Consider  the  5-degrees-of-freedom  model  of  a  vehicle  suspension  system  shown  in 
Figure  1  and  dealt  with  in  Haug  and  Arora  [6]  (pp.  25,  200): 


T=  y  m,Z]  +  y  m2Z]  +  y  IZ]  +  y  mAZ]  +  y  msZ] 
V  =  |  K,d]  +  y  +  y  K}d]  +  y  K,d]  +  y  K,d] 


(2) 
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where 

d\=  Z2  +  Zj-  Z| 

d2  —  Z$  —  Z2  — 

d3  =  Zs  -  Z2  +  ~  Zj 

«/«-  Z4-/2W 

d$  =  Zs  ~  f\U) 

Zi 


The  Qj  are  found  from 

S  W  —  £  QfiZj  =  —  —  Cidfidi  ■  •  *  —  05^56^5  (3) 

/-i 

The  MACSYMA  program  and  output  for  the  above  problem  is  given  in  its  entirety  in 
Appendix  I.  We  comment  on  the  key  commands.  (C2)  establishes  the  vector  Q  of  general¬ 
ized  coordinates  Z1,  ■  *  Z5.  (C3)  establishes  the  dependence  of  the  elements  of  Q  on  time. 
(C4MC7)  establish  mass,  spring,  damping  and  displacement  vectors  [for  DISP  equations  fol¬ 
lowing  (2)  above].  (C11),(C12)  derive  the  generalized  forces  Qn  (=QQ(n)  in  the  program)  by 
picking  out  the  coefficients  of  SZ„  (=DEL(Q(N))  in  the  program)  in  8  W  in  (3)  above  (=DW 
in  the  program).  (C9),(C10)  establish  kinetic  and  potential  energies  defined  in  (2)  above. 
(Cl 3)  forms  and  displays  the  equations  of  motion  by  evaluating  (1)  above.  The  key  com¬ 
mand  here  is  DIFF(EXPR,T)  where  EXPR  is  some  function  of  7,  which  takes  the  derivative  of 
EXPR  with  respect  to  T.  Thus 

DIFF(DIFF(TT,DIFF(Q(N),T)),T)  =  -7- 

at 


As  requested,  the  computer  then  displays  the  equation,  of  which  we  have  shown  only  the 
firsL,  namely 

m,Z,  +  c,Z,  -  c,Z2  -  -jj  c,Z3  +  KXZ,  -  K2Z2  -  ~  K,Z,  =  0 
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2.  SLIDER  CRANK  PROBLEM 


This  example  illustrates  the  derivation  of  equations  of  motion  when  constraints  are 
present.  The  appropriate  Lagrange  equations  are: 

f  |i|.|i  +  4*:  +  (J»rx.a  (4) 

Bl  dcfr]  bit,  la«, 


where  X,  V,  and  Q,  are  as  previously  defined  and  the  constraints  are  represented  by  k  alge- 
braic  equations: 

$(<?)  =  G 

and  \  are  k  values  of  Lagrange  multipliers  or  undetermined  coefficients. 


Figure  2. 


For  the  slider  crank  mechanism  shown  in  Figure  2  we  have 
q  =  l<J>[)X2,y2i<l>2*xl)T 

X=  y  Mf  +  y  m2{xl  +yl)  +  y  Mi  +  y 

8M/  =  /(/)8x3 

<!>!  =  /•  sin<£  \  —  y2  “  l  sin02  —  0  (5) 

0>2  =  r  cos^j  -  (x2-  /cos</>2)  =  0 
#3  —  *2  +  /  COS02  —  *3  —  0  t 
O4  =  y2  "l"  f  sin^>2  —  G 

Again  the  procedure  outlined  is  easily  handled  by  MACSYMA  (see  the  symbolic  program 
given  in  Appendix  11).  From  the  output  of  this  program  we  have  the  following  equations  of 
motion  for  the  system: 

+  [k[  r  cos</>i  -  k2r  sin^J  =  0 

m2x2  +  [— X2  4*  X.3]  =  0 

m^y '2  +  [ —  A 1  +  =  0  (6) 

J2<j>2  +  [Xi  /  cos 02  —  k2  I  sin</>2  —  M  /  sin^>2  +  A4  /  cos^2l  =  0 

W3X3  -  X3  =  /(/) 
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Note  that  (5) ,(6)  are  a  differential-algebraic  system  consisting  of  nine  equations  in  nine  un 
knowns. 


This  problem  has  only  a  single  degree  of  freedom.  If  this  is  chosen  as  we  find  from 
(5)  that  *2,  yi,  0 2>  anci  *3  can  be  expressed  in  terms  of  <f>\.  If  these  are  substituted  in  the 
kinetic  energy  (=7T)  expression  we  find  TT  —  7T(0i,$i).  The  equation  of  motion  is  now 
given  by 


A  ( <LZZ  _  QjZ 

dt  |  60]  1 

i.e.,  one  single  differential  equation  in  one  unknown,  with  no  constraints, 
program  for  deriving  this  equation  is  given  in  Appendix  III. 


(7) 

A  MACSYMA 


Equation  (7)  must  be  equivalent  to  nine  equations  in  (5), (6),  though  derived  indepen¬ 
dently.  To  deduce  (7)  directly  from  (6)  we  can  proceed  as  follows: 


Write  (6)  and  (5)  in  matrix  notation  as 


Mq+  AT(q)k  =  /  (8) 

<*>($)  =  0  (9) 

A(q)  is  a  4x5  matrix,  so  that  the  equation  A  (q)x  =*  0  has  a  solution  of  the  form 
x  =  Cx0(q),  where  C  is  an  arbitrary  constant.  Multiplying  (8)  by  x^  obtaining 

XqM  q  =  X(J  / 


expresses  xo(q)  in  terms  of  (f>\  only. 
(7). 


This  leads  to  the  single  differential  equation  given  by 


In  this  problem,  another  approach  would  be  to  choose  two  generalized  coordinates  with 
one  side  constraint.  This  would  lead  to  two  ordinary  differential  equations  involving  02 
and  one  Lagrange’s  multiplier  and  one  side  constraint.  These  can  be  obtained  either  directly 
[as  (7)]  or  by  eliminating  three  of  the  X’s  in  (6). 


3.  JACOBIANS 


If,  instead  of  looking  at  specific  examples  as  in  the  last  two  sections,  we  consider  general 
formulations,  then  the  following  type  of  situation  arises.  Suppose  that  cartesian  x- 
components  x,  (/  =  1 ,  ■  •  •  ,  n)  depend  on  generalized  coordinates  q}  0=1,  ■  ■  •  ,p).  Then 


*'  5,  H  9j 

p  p  p 

x,  =  £  Qj  +  X  X 


B2Xj 


A  H  A  A 


The  quantity  d<£>/dq  in  (4)  has  the  same  form  as  the  Jacobian  [bxjbqj]  occurring  above,  and 
the  MACSYMA  command  for  b<f>/bq  is  given  in  the  last  line  of  Appendix  II. 


It  is  convenient  to  use  the  MACSYMA  subroutine,  or  BLOCK  to  obtain  b2xi/dqJdqk ,  and 
this  is  done  in  Appendix  IV. 
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4.  SENSITIVITY  ANALYSIS 


The  objective  of  this  section  is  to  illustrate  how  MACSYMA  deals  with  some  aspects  of 
sensitivity  analysis,  with  particular  reference  to  the  paper  by  Haug  and  Ehle  [7],  Consider  a 
dynamical  system  described  by  design  variables  b=lbl%  •  •  •  ,  bk]T  and  a  state  variable 
z(t)  —  [zjC/) ,  •  •  •  ,zk(t)]  T  which  is  the  solution  of  an  initial  value  problem  of  the  form 

i  —  f(z,b)  ,  •  0  <  t  <  T 

z( 0)  -  h(b) 


where  i  —  dz/  dt  and  T  is  determined  by  the  condition 
A(7\z(70)-0 


A  typical  function  that  may  arise  in  a  design  formulation  is 

c  T 

iff  -  g(z(T),b)  +  I  F(t,z,b)dt  (10) 

Jo 

It  is  required  to  find  dififdb ,  which  is  a  k -vector.  This  is  done  by  considering  an  adjoint  vari¬ 
able  X  satisfying 

\  +  J  jk  -  Fj 

and  then 

T 

*  Sb  “  kT(0)hb  +  Jf  C Fb-kTfb)dt  (11) 

The  procedure  outlined  above  is  carried  out  by  the  MACSYMA  procedure  given  in 
Appendix  V.  This  also  illustrates  the  MACSYMA  solution  of  linear  equations  by  Laplace 
transform.  Consider  an  example  given  in  Ref.  [71,  namely  a  simple  oscillator  governed  by 
the  equation 


x  +  lac  ~  0  0  <  /  <  t r/2 

jc(0)  =  0  jc(0)  =  v 

with  if;  -  x(tt/2 ),  b  =  [k,v]T  =  [bub2]T. 

The  results  of  the  MACSYMA  procedure  give  ihe  first  derivative  of  the  functional  ip  as 


(12) 


it. 

db 


2  bV1 


sin 


7 T  TT  b2  TT^J~b\ 


+ 


sin 


4^i 

Vyfb[ 


cos 


(13) 


Higher-order  sensitivity  analysis  requires  the  Jacobian  for  which  a  BLOCK  MACSYMA 
command  is  given  in  Appendix  IV  as  discussed  in  the  last  section. 


Note;  MACSYMA  is  awkward  for  differentiating  functions  having  a  definite  integral;  e.g., 
from  (10)  we  have 

T 

=  gl[zb(T)  +  z(T)(T)b]+ gb+  f  [F,zb  +  Fb)dt  +  F(T)(T)b  (14) 

UO 

However,  MACSYMA  does  not  perform  the  derivative  under  the  integral  sign  (a  possible  di¬ 
alogue  with  MACSYMA  is  given  in  Appendix  VI). 
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5.  A  SPACECRAFT  PROBLEM 

Levinson  [10]  has  described  in  detail  an  application  of  the  symbolic  language  FORMAC  to 
formulate  the  spacecraft  problem  shown  in  Figure  3,  consisting  of  two  rigid  bodies  with  a 
common  axis  of  rotation  b . 


Figure  3. 

The  equations  are  given  in  Ref.  [10]  in  complete  detail,  and  are  translated  into 
MACSYMA  in  Appendix  VII.  To  illustrate  the  point  we  give  typical  equations  with 
MACSYMA  equivalents; 


Equations  from  Ref.  [10] 

MACSYMA 

n 

=  cosg  b2  +  sing  63 

a) 

R12J  :CQS  (Q)  *  B  [21 + SIN  (Q)  *  B[3] ; 

—  U\  bi  +  u2  b2  +  W3  63 

(3) 

WB:U[1]*B[1]  +  U[2i*B[2]  +  U[3]*B[3] 

"4 

=  k 

U[4]:DIFF(Q,T); 

=  4 

at 

(7) 

We  could  implement  this  last  mathematical  expression  (7)  [10]  by  converting  the  vector 
product  into  matrix  form  but  it  was  simpler  to  write  a  BLOCK  function  to  do  this,  as  in 
Appendix  VII.  The  MACSYMA  equivalent  of  (7)  is  now 

ALPR:D1FF(WR,T)  +CROSS(WB,WR); 

We  discuss  only  one  other  correspondence.  Equation  (27)  in  Ref.  [10]  is 

becomes  in  MACSYMA 

i 

F(R):-DOT(DIFF(VBS,U[R]),FB) 

+DOT(DIFF(WB,U[R]),TB); 

Dot  in  the  above  is  defined  by  another  block  in  Appendix  VII.  The  distinction  between  := 
and  as  used  in  this  command,  is  discussed  in  Appendix  B. 

The  complete  set  of  equations  given  in  Ref.  [10]  is  generated  by  Appendix  Vll.  The 
reader  should  compare  the  corresponding  FORMAC  program  given  in  Levinson  [10]. 
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6.  AN  EXAMPLE  OF  MANIPULATION  AND  SIMPLIFICATION  USING  MACSYMA 


In  previous  sections  we  have  not  found  it  necessary  to  use  powerful  commands  in 
MACSYMA  concerned  with  the  simplification  of  complicated  expressions.  As  an  introduction 
to  the  manipulation  needed  in  the  later  sections,  we  present  the  following  simple  dialog: 

(C2) 

F:(X  +  Y+Z)'2/Y; 

(D2) 

(Z+Y+X)2 

Y 

<C3) 

EXPAND(%); 

(D3) 

Y  +  +  2Z  +  Y  +  Y  +  2X 

(C4) 

COMBINED); 

(D4) 

Z2  +  2XZ  +  X2  +  az  +  Y  +  2X 

(C5) 

XTHRU(%); 

' 

(D5) 

Z2  +  Y  (2Z  +  Y  +  2X)  +  2XZ  +  X2 

Y 

<C6) 

RATSIMP(%); 

(D6) 

Z2  +  (2Y  +  2X)  Z  +  Ya  +  2XY  +  X2 

Y 

<C7) 

EV(%,Z"0); 

(D7) 

Ya  +  2XY  +  X2 

Y 

(C8) 

SUBST(SIN(2*TH),X,%); 

(D8) 

Ya+2SIN(2TH)  Y  +  SINa(2TH) 

Y 

(09) 

TRIGEXPAND(%); 

(D9) 

Yf 

+  4  COS  (TH)  SIN  (TH)  Y  +  4  COS2(TH)  SIN2(TH) 

Y 

(CIO)  TRIGREDUCE(%); 

(DIO) 

Y  _  +  2 SIN(2TH) 
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7.  THE  FOUR-BAR  LINKAGE  COUPLER  CURVE 

The  objective  of  this  section  is  to  illustrate  how  MACSYMA  performs  algebraic  and  trigo- 
nometric  manipulations  encountered  in  the  analysis  and  synthesis  of  linkages. 


Figure  4, 

Consider  first,  following  Hartenberg  and  Denavit  [5]  (p.  150),  the  four-bar  linkage 
AO A  OqB  shown  in  Figure  4,  where  the  bars  lie  in  a  plane  and  are  pin-jointed  at  A,  0A ,  0B , 
and  B,  and  the  positions  of  0A  and  0B  are  fixed.  MAB  is  a  lamina,  so  that  M  is  fixed  rela¬ 
tive  to  A  and  B.  If  BOb  is  rotated  about  0B ,  the  point  M  will  trace  a  planar  curve,  the  equa¬ 
tion  of  which  we  wish  to  determine. 

Using  the  linkage  parameter  shown  in  Figure  4  we  have: 
x' —  x—  bcosO  x”  =  x  —  a  cos  (0  +  y) 

y'  =  y  —  6  sin#  y”  =  y  -  z  sin  (0  +  y)  (15) 

r 2  —  x'2  —  ya  —  0  s2  —  (x"—  p )2  —  y”2  =  0 

The  required  equation  for  the  motion  of  M(x,y)  is  obtained  by  eliminating  (x\/),  (x"yf), 
and  0  from  (15).  This  is  done  in  Appendix  VIII.  Elimination  of  (x\/)  and  (x",/')  leads  to 
the  equation  of  the  form 

N  cos0  —  L  sin0  =  0 
— PcosO  +  M  sinO  =  0 

where  L  =  (x  —  p)  siny  —  y  cosy  ,  N  =  (x  —  p)  cosy  +  y  siny 
M  -  y  ,  P  =  —x 

0  -  -jj  0 ’1  +  xl- r1- b2),  >li  =  ~  [(jc-p)2+/+a2-i2| 

Eliminating  9  from  (16)  gives 

(P0  +  iV0)2+  (Af0  +  L0)2=  ( LP—MN )2  (17) 

This  sixth-degree  polynomial  in  (x,y)  is  called  the  tricircular  sextic.  The  determinant  of  (16) 
vanishes  when  LP—  NM  =  0,  i.e., 
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(18) 


x(x-p)  +  y2  -  py  coty  =  0 

The  above  equation  is  called  a  circle  of  singular  foci. 

We  next  derive  a  basic  relation  used  to  synthesize  four-bar  linkages,  namely  the  so-called 
displacement  equation  which  gives  the  output  angle  if/  for  a  given  input  angle  <f>  in  Figure  5. 


B  (xhy2) 


Figure  5. 

x2  —  cos  0 ,  y2  =  ui  sin0 

*3  *  “04  +  <23  cost// ,  y2  —  tf3sim// 

-  (x2-  X3)2  -  (yi-yi)2  -  0 
Eliminating  (*2^2)  an(*  (*3>.V3)  from  (19)  leads  to 
A  sin \jj  +  B  cos ^  =  C 

where  A  =  2<31a3sin<£ ,  B  —  la  103  cos<£  +  2^304 
C  =  2^^4  0080  +  {.a}  +  a2  -  a\  +  a2) 

To  solve  (20)  for  1 (j  set 


.  ,  2  lam/// 2  ,  1  -  tan2i///2 

sin  ^  /  cos  \fj  - - 

1  4-  tan^/2  1  +  tan2i///2 

Substitution  into  (20)  leads  to  a  quadratic  in  tarn//,  the  solution  of  which  is 


(19) 


(20) 


(21) 


tan  "jf = 


tff+C) 


l  +  B2-C2 


The  two  solutions  correspond  to  the  two  ways  to  close  the  four-bar  linkage  shown  in  Fig¬ 
ure  6. 


152 


Figure  6. 

For  the  purpose  of  synthesis,  (20)  can  be  rewritten  as: 

K\Cos<f>  -  £2cosi/»  +  A:3  =  cos (22) 

with  K\  —  ““  ,  K-2  =  a%  =*  (a}  +  a$  +  a} -2a\QiK^)l a\az  (23) 

^3  ®  1 

Hartenberg  and  Denavit  [5]  (p.  297)  discuss  the  problem  of  designing  a  planar  four-bar 
linkage  such  that,  to  three  given  positions  fa9  fa,  and  fa  of  the  crank  OaA  ,  there  correspond 
three  prescribed  positions  \fs2 ,  and  «/>3  of  the  follower  0#B.  The  form  of  (22)  is  well- 
suited  for  this  purpose.  The  solution  in  this  case  is  obtained  by  solving  the  set  of  three 
simultaneous  equations  for  Ku  K2,  and  Ky  obtained  by  substituting  =*<£,,  ^  =  ^/}  /  =  1,2,3 
in  (22),  and  then  obtaining  a3,  a\9  and  a2  from  (23)  (<24  can  be  selected  equal  to  one). 
Appendices  VIII  and  IX  give  the  MACSYMA  program  Lo  carry  out  the  above  procedures. 
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8.  DUAL-NUMBER  QUATERNIONS 

Next,  consider  a  laborious  calculation  contained  in  the  appendix  to  the  Yang  and  Freuden- 
stein  paper  [19]  in  connection  with  the  analysis  of  a  spatial  four-bar  mechanism.  We  are 
given 

A  (2i)  sin04  +  P(0j)  cos04  =  C(6[)  (24) 

where  A  (&i)  =  sina12  sino^  sin§! 

27(0])  =  —  sin«34  (sina4i  cosai2  4-  cosa4i  sina^cosfli)  (25) 

C(0i)  —  cosa23  ~  cosa34  (cosa4i cosa^  —  sino^i  sinai2cos0i) 


Here 

«12  =  <X\2  +  €an  » 

(*23  =  «23  +  €023  , 
0:34  =  «34  +  €fl34  , 


§1  —  +  C-Sn 

02  —  #2  +  €S2 
03  =  ^3  4"  €$3 


a4i  —  a4 1  4-  €#41  »  04  =  04  4-  €S4 

where  €  is  a  symbol  with  the  property  e2=s  0.  This  implies  that,  if  0  =  0  4- €5,  then 
sin0  ~  sin0  4-  es  cos 0  ,  cos0  ~  cos0  —  es  sin0  . 

It  is  then  clear  that  (24)  can  be  reduced  to  the  form 


(26) 


P4-€Q  =  R  +  eS  (27) 

where  P,  Q,  R,  and  S  are  independent  of  e.  It  is  required  to  find  the  explicit  form  of  P,  Q, 
R,  and  S.  To  calculate  this  by  hand  is  extremely  laborious,  but  straightforward  in  MACSY- 
MA.  The  program  is  given  in  Appendix  X. 

Three-dimensional  problems  in  kinematics  and  dynamics  involve  laborious  calculations  in¬ 
volving  Euler  angles  and  Euler  parameters.  (See,  for  instance,  Nikravesh,  et  al.  [13],  and 
Wittenburg  [18].)  These  calculations  are  easily  handled  in  MACSYMA  in  a  routine  fashion. 
The  techniques  involved  are  illustrated  in  connection  with  other  examples  in  this  paper,  so  we 
do  not  elaborate  further. 
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9.  ROBOT  ARMS  -  THE  DIRECT  PROBLEM 


Robot  arm  manipulators  can  be  considered  to  consist  of  a  series  of  links  connected  togeth¬ 
er  by  joints.  It  is  convenient  to  use  cartesian  coordinates  by  assigning  a  separate  coordinate 
frame  to  each  link.  Without  going  into  detail  (which  can  be  found  in  Paul  [15],  for  instance), 
the  relation  between  the  coordinate  frames  assigned  to  one  link  and  the  next,  consisting  of 
translations  and  rotations,  can  be  derived  by  a  4x4  matrix  of  the  form 

nx  ox  ax  px] 

Ify  Oy  Oy 

A~  nz  o2  az 

.0  0  0 

where  the  elements  of  the  top  left  3x3  submatrix  are  the  direction  cosines  representing  the 
rotations  and  (px  ,py  tpz)  is  the  translation. 

The  position  and  orientation  of  the  coordinate  frame  of  the  end  of  the  manipulator  is 
specified  by  six  parameters  (3  translations,  3  rotations).  A  general  manipulator  can  be 
designed  using  six  links,  each  having  one  degree  of  freedom.  If  T6  denotes  the  A  -matrix 
corresponding  to  the  end  of  the  manipulator,  and  (/—  1,  *  •  •  ,6)  are  the  ^-matrices  for 
the  individual  links,  Te  is  given  terms  of  the  A ,  by 


Py 

Pz 

1 


(28) 


T6  *  A\A2A3  a4a5a6 
A  typical  A  -matrix  for  a  link  is 
co$02  0  sin02  0 
sin02  0  -cos02  0 
0  1  0  d2 

0  0  0  1 


If  d2  is  fixed  and  02  is  a  variable  representing  rotation  of  the  second  link,  this  is  called  a 
revolute  joint.  If  02  is  fixed  and  the  translation  d2  is  varying,  this  is  called  a  prismatic  joint. 


The  so-called  “direct”  problem  is:  given  the  Ah  find  T^— this  is  obviously  straightfor¬ 
ward,  although  algebraically  laborious  (see  Paul  [15],  p.  59  and  Appendix  X) 


The  main  computational  problem  connected  with  the  direct  problem  is  the  question  of 
differential  motion  discussed  in  Paul  [15]  (Chapter  4).  These  are  important  in  connection 
with  dynamic  analysis  of  manipulators,  sensitivity  analyses,  and  small  adjustments  of  the  end 
manipulator. 

Without  going  into  detail  (which  can  be  found  in  Ref.  [15]),  the  computational  problem 
involved  is  the  following.  Suppose  that  the  six  parameters  representing  degrees  of  freedom 
are  denoted  by  a  6-vector  x,  small  changes  in  these  parameters  are  denoted  by  A*,  and  the 
corresponding  small  changes  in  the  three  displacements  and  three  rotational  parameters  of  the 
end  point  frame  of  the  manipulator  are  denoted  by  the  6-vector  A ,  then  we  have  a  relation  of 
the  form 


A  A  =  J  Ax 


where  the  /Lh  column  of  J  is 


4 

«/ 


;  where,  for  /  —  1 ,  •  ■  *  ,6,  and  for  a  revolute  joint, 
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(-“ixPiy  +  "iyPb) 

nz 

4- 

(” °ixPfy  +  °iyPix) 

8 

°z 

(—  QixPjy  +  aiyPjx) 

Qz 

and  for  a  prismatic  joint 


The  MACSYMA  program  for  the  symbolic  computation  of  A  and  the  numerical  example  in 
Ref.  115]  are  given  in  Appendix  XI  (see  Ref.  [15],  pp.  104-107). 
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10.  ROBOT  ARMS  -  THE  INVERSE  PROBLEM 

The  “inverse’*  problem  consists  of  obtaining  the  Ah  /'=1,  •••  ,6,  given  numerical 
values  of  T6.  In  theory  this  can  be  clone  from 


^1^2^3^4^5.46=  Ts  (29) 

which  gives  12  equations  in  6  unknowns  (the  6  degrees  of  freedom  of  the  links).  These 
equations  are  redundant,  and  there  are  only  six  independent  equations  in  the  six  unknowns. 
However,  the  equations  are  highly  complicated.  The  method  used  in  practice  is  to  consider 
also  the  following  equations  which  are  completely  equivalent  to  (29): 


A2A3A^AsA^^  A\X t6 
A3A4AsA6~  Az'AT'Tt 
A4A$  A$  ATlA2lAilT6 
A$A(,—  A4lA3*A  2^A\^  7$ 
A6=  A?lA4lA?lA2lAflT6 


(30) 


[In  practice  the  A~l  are  usually  easily  obtained  from  the  Ar\  Equations  (29)  and  (30)  give 
72  equations  for  the  6  unknowns.  The  procedure  is  now  to  pick  out  the  simplest  6  indepen¬ 
dent  equations  from  the  set  of  72.  The  simplest  solution  occurs  when  one  of  the  equations 
involves  only  one  unknown,  say  xh  another  equation  involves  x,  and  a  second  unknown  x2> 
a  third  equation  involves  only  xh  x2,  x3,  and  so  on.  The  system  can  then  be  solved  sequen¬ 
tially.  This  is  the  solution  with  Stanford  and  the  elbow  manipulators  described  by  Paul  [15]. 


A  more  complicated  situation  occurs  in  the  robot  arm  discussed  by  Lumelsky  [11],  where 
such  a  simple  sequence  of  equations  cannot  be  found.  Instead,  the  simplest  set  is  of  the  form 

X\  =  / 1  Gc3  ,*4)  xa  =  3  >*4) 

*3  =  /3<*l  >*2)  *4  =  /4(*L  1*?)  ^ 

These  can  be  solved  by  straightforward  iteration. 


In  Appendix  XII  we  give  a  MACSYMA  program  for  selecting-  the  basic  6  equations  from 
the  72  available.  This  can  be  done  automatically  by  using  the  command  FREEOF  to  print  out 
a  dependency  table  showing  which  of  the  variables  occur  in  each  of  the  72  equations.  Equa¬ 
tion  (31)  can  be  deduced  directly  from  this  dependency  table. 
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APPENDIX  I 

(G2)  Q:[Z1,Z2,Z3,Z-1,Z5|; 

(OS) 

(C3)  DEPENDS(Q,T); 

(D3) 

(C4)  MASS:tM1,Ma,l,M4,M5]; 

(D4) 

(C5)  SPRING  :tK1,K2,K3,K4,K5]; 

ID5) 

(C6)  DASH:[CC1.CC2,CC3,CC4,CC5); 
(D6) 


[Z1,Z2,Z3,Z4,Z5] 
[Z1  (T) ,  Z2(T) .  Z3(T) .  Z4(T) .  Z5  (T)) 
[M1,M2,I,M4,M5] 
|K1,K2,K3,K4,KS1 

[CC1 ,  CC2,  CCS,  CC4,  CC5l 


(C?)  0l3Pi[Z2  +  L/12*Z3-Z1,Z4-Z2-L/3*Z3,Z5-Z2  +  2*L/3*Z3,24-F1(T).Z5^F2(T)1; 
(07) 


^  +  Z2-Z1,  Z4--^|^-Z2,  Z5  +  -^r^-Z2,  Z4-FKT),  Z5-F2(T) 

12  s5  o 


(C8)  DERIVABBREViTRUE; 

(08)  TRUE 

(C9)  TT:1/2*(MASS.D1FF(0,T)-2): 

M5(Z5t)2  +  M4(Z4,)2  4-  l(Z3,)2  +  M2(Z2T)S  +  M 1  <Z1,)2 
(09)  - ! - ! - t- - 1 - — 


(CIO)  P0T:1/2'(SPRING . DISP‘2); 


(DIO) 


K3  Z5  + 


2LZ3 


-Zal'  +  KS  [z5-F2ml2-»-K2|z4--“-Z2 


+  K4  |z4  —  F1(T)|2  +  K1  1^-  +  Z2-Z1  /2 


(C11)  DW:-OIFF(DISP,T)*DIFF(DISP); 


(Dll) 


LZ3t 


-  +  Z2t-Z1t  DEL(Z2) 


LDEU(Z3) 

12 


-DEL(ZI) 


LZ3, 


Z3DEL(L)  I 


-  Z4t  — 


iT-  +  Z2t-z1t)  DEL(T)  +  ~  12 

LZ3t 


T“  3 


LZ3* 


-Z2t  DEL(Z4)—  DEL(Z2)  — 


Z3DEL(L)  | 


+  Z4T - - Z2TOEL(T) - 3 


2LZ3t 


L  DEU(Z3) 


2LDEL(Z3) 


-  |ZST  4- - ^  ~  Z2J  DEL(Z5)  -  0EL(22)  +  3 


2LZ3y 


2Z3  DEL(L)  I 


+  |25t + —3^  -  22t|  del  (T> + 

-|Z4T-F1(T)Tj  JdEL(Z4)  +  |z4t-F1(T)tJ  D6L(T)J  , 

■  -  (z5t-  F2CT)t]  |DEL(Z5)  +  [Z5T -  F2(T)T|  DEL(T)|] 

(Cl 2)  FORNTHRU5DOQQ[N]:RATCOEF(DW,DEL(Q[N|)); 

(D12) 

(Cl 3)  FOR  N  THRU  5  DO  LDISPlAY(EQUATION(N] 

=  OIFF(DIFF(TT,DIFF(OlN).T)),T)-DtFFnT,a[N))+OIFF(POT.QtN])-OASH.OO(N]); 


OONE 


LZ3 


'TT 


CC1  (LZ3t+12Z2t-  12Z1t) 


(E14)  EQUATION^  = 


-K3  Z5  + 


2LZ3 


-Z2-2K2  Z4--^  +  2K1 


12 


-+Z2-Z1 


M2Z2tt  — 


CC3  (3  Z5t  +  2  L  Z3t-  3 Z2T)  CC2  (3 Z4T -  L Z3j -  3Z2T) 


CC1  (LZ3t+  12Z2t-  12Z1t) 


(El  5)  EQUATION 3  * 

4K3l|z6+'?^-Z2|  ZKZujzA--^!2— Z2|  K1L  |^  +  Z2- Z1 
3  ~  3  ~  ~8 


■HZSr-r- 


CC  2  (3  L  Z4t  -  L2  Z3t  -  3  L  Z2t) 


CCS  (6  L  Z5t  +  4  L2  Z3y  -  0  LZ2?)  CC 1  (L2  Z3T  + 12  L  Z2T  -  1 2  L  Z1?) 


(El 6)  EQUATION4-M4Z4TT  +  - 
CC2(3Z4t-LZ3j-3Z2t) 


2K2  24-“-- Z2  +2K4(Z4-FKT)] 


-CC4  |-Z4t+F1(T)t] 


2K3  Z5  +  2L3Z3  -Z2  +  2  K5  |Z5—  F2(T)J 
(El 7)  EQUATtONg  "  M«Z5tT  +  ~ - 2 - — 

CC3  f3Z5T  +  2LZ3T-  3Z2t) 

- - - CCS  |-ZST  +  F2(T)jj 


/*  APPENDIX  I  (CONT.)  MACSYMA  PROGRAM  */ 

/* . GENERALIZED  COORDINATES...*/ 

Q;[Z1,Z2,Z3,Z4(Z5|; 

DEPENDS(QJ); 

/* GENERALIZED  MASS . */ 

MASS:[Ml.M2.I.M4,M6l: 

/* SPRING  CONSTANT . V 

SPRING:|K1.K2.K3,K4,K5); 

/* . DAMPING  CONSTANT . V 

DASH!(CC1,CC2.CC3,CC4,CC5]; 

/* . GENERILIZED  DISPLACEMENT..*/ 

DI9P:(Z2 -f L/12*Z3-Z1  ,Z4*Z2-L/3*Z3,Z5-Z2  + 2*L/3*Z3,Z4-F1  (T)  .Z5-F2(T)1 : 
DERIVABBREV:TRUE; 

/•.....KINETIC  AND  POTENTIAL  ENERGIES..*/ 

TT:1/2*MASS.DIFF(Q.T)*2; 

POT:  1/2*SPRING. DISP‘2; 

r . DW . */ 

DW  :-DIFF(DlSP.T)  "DlFF(DlSP) ; 

FOR  N  THRU  S  DO  QQ[N]:RATCOEF(DW,DEL(Q[Nl)); 

/* . EQUATIONS  OF  MOTION . */ 

FOR  N;1  THRU  5  DO  LDI9PLAY  (EQUAT)ON[N|- 
DIFF(DIFF(TT,DIFF(Q(NJ,T)),T) 

-D1FF(TT,Q(N)) 

+  DIFF(P0T,Q[N|) 

-DASH.OQ(NI); 


APPENDIX  II 

/*  CO-ORDINATES  */ 

Q:[PH1,X2,Y2.PH2.X31; 

OEPENDS(QJ); 

MASS:(J1,M2,M2,J2,M3]; 

CONSTRAINT  :(R*S1N(PH1)-Y2+  L*SIN(PH2), 

R‘COS(PH1)-X2  +  L*COS(PH2). 

X2  +  L*COS(PH2)-X3, 

Y2  +  L*SIN(PH2)I; 

/*  LAGRANGE  MULTIPLIERS  */ 

LAM:[LAM1.LAM2,LAM3,LAM4); 

/*  KINETIC  ENERGY  */ 

TT;(1/2*DIFF(O.T)'2).MASS; 

/*  EQUATION  OF  MOTION  V 

FOR  1:1  THRU  S  DO  LDISPLAY  (QQ(l|“DIFF(0lFF(TT,DIFF(Q[l1,T)),T) 
-DlFF(TT.QtlJ)  + 

LAM.(DIFF(CONSTflAINT.Qm))): . 


A- 
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APPENDIX  III 

r  ALTERNATE  WAY  TO  DO  SLIDER  CRANK  PROBLEM  7 

Q:lPH1,X2.Y2.PH2,X3): 

DEPENDS(O.T); 

Y2:1/2*fi*31N(PH1); 

X2:R*COS(PH1)  +  L*SQRT(1-R**2/(4*L-2)*SIN(PH1»; 
X3:X2+L*C05(PH1); 

PH2:ASIN(R/(2'L)*SIN(PH1|); 

MA5S:Ul.M2.M2,J2.M3); 

DERIVAB0REV:TRUE; 

TT:(1/2*DIFP(O.T)'2).MA8S; 

TT:EV(TT,DIFF); 

EQUATION:DIFF(DIFF(n,piFF(PHl.T))J)-DlFF(TT,PH 


APPENDIX  IV 

r . TEST  FOR  JACOBIAN . HIGHER  DERIVATIVES.. 

DIMENSION  OF  A  X  AND  V  ARE  P  K  AND  M  RESPECTIVELY 
AND  SECOND  ORDER  DERIVATIES  ARE  FORMED  7 
jAC2(A,P,X.K,Y.M):-BLOCK( 

FOR  L:1  THRU  P  DO  ( 

DEPENDENT  :DETERMINANT(A[LJ1, 

FOR  1:1  THRU  K  DO  ( 

VARI A  B  L£  1 :  DETERM  IN  ANT(X  [ID . 

FOR  J:1  THRU  M  DO  ( 

VARIABLE2:DETERMINANT(Y[JD. 

PART(L1l,Jl:OlFF(DIFF(OEPENOENT1VARlA0LE1),VARlABLE2l)))); 

/*  FOLLOWING  IS  A  SIMPLE  EXAMPLE .  7 

A:MATRlX<[Zl«01**2-2*B1*Z31,[B2“2*Z2l,[3*B1**2*Z3-B1**3*Z1)): 

X:MATniX(tBl|,lB2|,(03|); 

Y:MATRIX«Z1UZ2],[Z3»; 

JAC2(A,3(X.3,Y,3); 

FOR  L-1  THRU  3  DO  (  FOR  1:1  THRU  3  DO  (  FOR  J:1  THRU  3  DO 
(LDISPLAY  (PART[L.I,J1)))); 

/'  IN  THE  FOLLOWING  CASE  A  HAS  DIMENSION  N  BY  M 

AND  B  HAS  P  BY  1  AND  FIRST  DERIVATIVES  ARE  FORMED.  7 
JAC3(A,N,M,0,P):-BLOCK(FOR  L:1  THRU  P  DO  I 
VARIABLE:DETERMINANT(B(LD, 

FOR  1:1  THRU  N  DO  I 
FOR  J:1  THRU  M  DO  ( 

DEPENDENT :  t  A(l ,  J]), 

PART[L.UI:DIFF(DEPENDENT, VARIABLE))))); 

AA:MATRlX([Zl**2*B3,Z2**2*Bl,Z3*B1I,[Z2‘*3*B31ZrZ2,ZrB3), 

[23*‘2-FZ2,Z2*02*Z3,Zl*93**2l); 

JAC3(AA,3,3.X,3); 

FOR  L;1  THRU  3  DO  ( 

FOR  1:1  THRU  3  DO  ( 

FOR  J:1  THRU  3  DO  LDISPLAYf 
PARTfLUD)); 


Z2(T); 

Z2(T):“”%; 

JACto.N.B.W-eLOCKHPARTj, 

FOR  1:1  THRU  N  00  ( 

DEPENDENT:DETERM!NANT(A[IJ), 

FOR  J:1  THRU  K  DO  ( 

VARIABLE:DETERMINANT(8[JD, 

PART[t,J!;OlFF(DEPENDENT, VARIABLE))), 

GENMATRIX(PART,N,K,1,1)): 

/*  NOW  WE  SOLVE  FOR  ADJOINT  VARIABLES  V 
DEPENDS(tLAM1,LAM2).[TD; 

LAM:MATRlX[[LAMl(T)l,tLAM2tT»); 

EQ1:DIFF(LAM,T)+TRANSP0SE(JAC(F.2,Z,2)).LAM; 

/*  SINCE  LAPLACE  TRANSFORM  SOLVES  WITH  INITIAL  VALUES 
ONLY  WE  ASUUME  ALPHA  AND  BETA  AS  THE  INITIAL  VALUES 
AND  SOLVE  FOR  ALPHA  AND  BETA  FROM  THE  FINAL  VALUES 
OF  THE  SOLUTON  V 
ATVALUEILAM 1  CT).T-0, ALPHA) ; 

ATVALUE(LAM2fn  ,T  -  Q.BETA) ; 

P1:DETERMINANT(EQ1[1]): 

P2:DETERMINANT(EQH2!); 

EQ2:LAPLACE(P1,T.S): 

EQ3:LAPLACE(P2.T,Sl; 

UNSOL  VE([EQ2,EQ3ULAPLACE(LAMim.T.S), 

LAPLACE(LAM2<T),T,S)]); 

PP1:RHS(FIRST<%)); 

PP2:RHS<LAST<%TH(2))); 

LAMHT):  —  ILTIPPI.S.T); 

LAM2(T):-ILT[PP2,S.T>; 

/*  THIS  THE  SOLUTION  OF  THE  ADJOINT  VARIABLES  7 
LAMlCT); 

LAM1(T):  — 

LAM2(T>; 

LAM2(T):-"K,: 

LINSOLVE«LAM1t%P1/2)  +  1,LAM2|%PI/2)l,lALPHA,BETA)).GL08ALSOLVE:TRUE; 
/*  ALPHA  BETA  HAVE  TO  BE  SUBSTITUTED  IN  THE  ABOVE  SOLUTION  7 
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/*  DESIGN  SENSITIVITY  ANALYSIS  7 
DEPENDS([T^1.[SI); 

DEPENDS([ZZ).[T.Bl): 

DEPEND8<[G|.tZZ.B|): 

PSI  :G  +  INTEGRATE(FF(Y,Z,B)  ,Y,O.T) ; 

OIFF(PSI,B): 

EVft.OIFF); 

t*  NOTE:  THE  LAST  COMMAND  GIVES  UNDESIRABLE  RESULTS. 
(SEE  REF.I ))  THE  PROBLEM  CAN  POSSIBLY  BE  HANDLED 
BY  GRAOEF  [71  COMMAND  7 
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/‘..THIS  IS  THE  FIRST  PROBLEM  OF  HAUG,  SEE  [71 
B  IS  THE  DESIGN  VARIABLE  AND  Z  IS  STATE  VARIABLE  . 
F  IS  THE  RIGHT  HAND  SIDE  OF  DIFFERENTIAL  EQUATION 
AND  H  IS  B.C  7 
H:MATRIX([O1,[021): 

B:MATRIX([Bll,[B2l): 

F;MATRIX((Z2(T)U*B1*Z1CT)1); 

Z:MATRIX([Z1(T)].[Z2(T)1); 

/*..SET  UP  OIFF  EQ  ANO  SOLVE  BY  LAP  TRANSFORM..*/ 

/* . 7 

EQ1:DIFF<Z,T)-F; 

r  INITIAL  VALUES  ARE  GIVEN  HERE  7  . 
ATVALUElZMTl.T-O.O); 

ATVALUE(Z2(T),T-O.02l; 

P1:DETERMINANT<EQ1111); 

P2:DETERMINANT(EQ1[2J): 

E02:LAPLACE(P1,T.S); 

EQ3:LAPLACE(P2,T,S): 

UN3OLVE(lE02>EQ3l.tLAPLACE(ZimiT,S)1 

LAPLACE(Z2(T),T,S)1); 

PP1:RHS(FIRST(%)); 

PP2:RHS(LAST(%TH(2))); 

/'  INVERSE  LAPLACE  TRANSFORM  V 
Zim:  =  ILT(PP1.S.T); 

Z2(T):-ILT(PP2,S,T): 

Z1(T); 

21(T):  -  "%; 
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/* . UNIT  VECTORS  ARE  B1  02  B3 . SEE  LEVINSON  V 

/’ . DEFINE  DOT  AND  CROSS  PRODUCTS..,,*/ 

DOT(V1.V2):"0LOCK([P,PPl, 

FOR  1:1  THRU  3  DO  Plll:RATCOeFFWl.B[ll)( 

FOR  1:1  THRU  3  DO  PP[ll:RATC0EFF(V2.B(l)). 
P141:SUM(P[I1*PP(IU,1,3), 

R6TURN(P[4)))S 

CROSStVI  tV2) ;  -  BLOCK«P,PP  .PPP), 

FOR  1:1  THRU  3  DO  P[ll:RATCOEFF<V1.B[l]>, 

FOR  1:1  THRU  3  DO  PP[11:RATC0EFF(V2,Blll), 

PPP[11:(P|2l*PP[3)-P[3rPP[2l)t 

PPP[21:(-P111*PP|31+P|3]‘PP111), 

PPP|31:(P[1I*PP[2J*P[2)*PP[1J). 

PPPt4l:B[11*PPP[lH-Bt2l*PPP[2j+Bl3l*PPPL3l. 

RETURN(PPP[4|))S 

/‘....NOW  WE  INPUT  EQUATIONS  FROM  LEVINSON’S  PAPER  V 
DEPENDS(U.T); 

DEPENDSIQJ); 

R[2l:COS(Q)*B(2]  +SIN(Q)*B[3l; 

R  [31  :-SlN(0)*B[2) + COS  (Q)  *B[3] ; 

WB:U[1)*B[l|  +  U[2]*B(2|+U|3l*B(3J: 

OERIVABBREV:TRUE; 

U[4l:DlFF(0.T): 

WR:(Ut1  J  +  U[41)*B[1|  +  U|2|*B(2J  FU[3l*0l3l; 
ALP0:OIFF(U(ll.T)*B[11+DlFF(U[2l.T)*B[2|+DIFF(U|3l,T)*Bl3l; 
ALPR:DIFF(WR.T)  +  CROSS(WB,WR); 

PPBS:B  1*B[1J  4-  B2*B[2|+03*B[3|; 
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PPRS:Rl'B[l]  +  n2*m2]+R3*Rl3J: 

PRSBS:PP9S-PPRS; 

VB8:U[5)*0t11  +  U[6l*Bt2l+U[7]*B[3l: 

VRS:VBS  +  DIFF(PRSBS.T) + CROSStWB.PRSBS) ; 

ABS;DIFF(VBS,T)+CnOSS(WB,VBS); 

ARS:DIFF(VRS.T)-t-CROSS(WB.VR8): 

IBBSWB:0ET1*Bl1]*DOT(Bl1],WB)+0ET2*8t2j*DOT<B[2).WB)+BET3*B[3]*DOT(B[3].WB); 
IRRSWR:RH0rBtl]*D0T(B[1](WR)+RH02*B[2|*D0T(B(2],Wfi)+RHO3*B[3|*DOT(B[3],WR>; 
IBBSALPB:BETl*B(l|*DOT(B[1],ALPB)  +  BET2*B(2l*DOT<B[2]tALPB) 

+  BET3*  B[3]  *DOT(B[3]  ,ALPB) ; 

IRRSALPR:RHOrBt11*DOTmm,ALPR)-l-RHO2*Br2],DOT(0[2!lALPn) 

H-RH03*B[3]*D0T(B(3I.ALPR): 

FSB:-M0*ABS: 

FSR:-MR*ARS; 

TSB:CROSS(IBB$WB,WB)-IBBSALPB; 

TSR:CR0SS(IRRSWR,WR)-1RR3ALPR: 

FB:F1*B[1]  +  F2*B[2HF3*B(3|; 

TB;T1‘BlU+T2*B[2l+T3*B(3]; 

F[R]:-DOT(DIFF(VBS,U[Rl),F6)+DOT(DIFF<WB,UtRl),TB); 
FS[R]:i-DOT(DIFF(VBS,U[R]),FSB)  +  DOT{DlFF(VnS,U[R]),FSR) 

+  DOT(DIFF(WB,UtR|)(T3B)  +  DOT(DIFF(WR,U[R]),TSR); 

EQ{R|:-F(R]+F3(R1; 

EQtll; 

FOR  1:1  THRU  7  DO  LD1SPLAY  (  X[1,|]:RATCOEFF(EQ[1],DIFF(UtlJ,Tl)J; 
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XP:X*B‘COS(TH); 

YP:Y-0*3IN(TH); 

XPP:X-A*COS(TH+GAM); 

YPP:Y-A*SIN(TH  +  QAM); 

EQ1:R*‘2-XP**2-YP“2; 

6Q2:S"2-(XPP-P)**2-YPP**2; 

EQ1:EXPAND(EQ1); 

EQ1:RATSUBST(1-SIN(TH)‘2,C0S(TH)"2,‘)«.); 

EQ1:EQ1/(2*B); 

EQ2:EXPAND(EQ2): 

EQ2:TRIGEXPAND(EQ2); 

EQ2:RATSUBST(1-COS(TH)'2tSIN(TH)‘2.EQ2>: 

EQ2:RATSUBST(1-COS(QAM)'2,SIN(GAM)‘21(Xi): 

EQ2:EQ2/(2*A); 

LL:RATCOEF(EQ2,-SIN(TH)); 

NN:RATCOEF(EQ2,COS(TH)): 

MM:RATC0EF(E01,SIN(TH)); 

PP:RATCOEF(EQVCOS(TH)); 

PHPH:E01-MM*SIN(TH)  +  PP*C0S(TH); 

PSIPSI:EQ2+LL*SIN(TH)-NNX03(TH); 

PSIPSI:RATSIMP(%): 

/*  APPENDIX  VIII  (CONT.)  V 

EQ11:-P*C0S(TH)  +  M*SIN(TH>-PH; 

EQ22:N*C0S{TH)-L*SIN(THl-PSI; 

SET:UN30LVE([E011,EQ22].[C0S(TH),SIN(TH)3); 

CTH:RHS(PART|EV(SET),1)); 

3TH:RHS(PART(EV(SET),2»: 

STH'2+CTH'2; 

%-1; 

XTHRU(%);  ' 

NUM(%); 
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r . SNTHESIS  BY  ANALYTIC  METHODS . */ 

X2:A1*COS(PH); 

Y2:A1*SIN(PH); 

X3:-A4  +  A3*COS(PSI); 

Y3;A3*S1N(PSI); 

F1:A2‘2-(X2-X3)’2-(Y2*Y3)"2; 

EXPAND  (%): 

F11:%; 

AA:RATCOEF(F11fSIN(PS1)); 

BB:RATCOEF(F11,COS(PSI)); 

F11-AA*SIN(PSI)-BB‘C0S(PSI): 

RATSUBST(t-SIN(PH)'2.COS(PH)'2.1fc): 

RATSUBST(1-SIN(PSI)*2,COS(PSl)'2,%|; 

T11:%; 

T11!T11/(2*A1*A3); 

/* . SOLUTION  BYDENAVIT  METHOD . 7 

F1:A*SIN(PSI)-fB*COS(PSI)-C; 
RATSUSST(2*TAN(X)/(l  +  TAN(X)"2),SIN(PSI),%); 


RATSUBBT((1-TAN{X)**2)/(1+TAN(X)**2),COS(PSI).%): 

RATSIMP(%): 

SOLVE  (%,TAN(X)): 

/* . SOLUTION  BY  HARTENBERG  AND  DENAVIT.,.,7 

FF(PH.PSl):  -  Kl*CO3(PH>-K2*C0S(PSI)  +  K3-COS(PH-PSI); 

/* . NOW  A  NUMERICAL  EXAMPLE . V 

/* . CH6BYCHEV  SPACING  IS  GIVEN  BY 

XK— A+H*CQ9(2K-tl*PI/2N 

WHERE  A-MEAN  H-HALF  THE  INTERVAL  OF  X . V 

KEEPFLOAT:TRUE: 

X(K):-3/2+1/2»COS((2,K-1)*%P!/e); 

X1:X(3).NUMER; 

X2:X(2); 

X3:X(1),NUMER; 

Y1:LOG(X1)/LOG(10),NUMER; 

Y2:LOG|X2)/LOG(10),NUMER; 

Y3:LOG(X3)/LOG(10>,NUMER; 

D6LPH:60/160*%PI: 

DELP8I:60/1B0*%P|; 

PH1:0; 

P8I1:0; 

YF:LOG(2)/LOG(lO),NUMER; 

PH2:(X2-X1)*DELPH,NUMER; 

PH3:(X3-X1)*06LPH,NUMER; 

PSl2:(Y2-Yl)/YF*DELPdl.NUMER; 

PSI3:(Y3-Y1)/YF*DELPSI.NUMER; 

A4:1; 

EQ1:FF(PH1,P9I1); 

EQ2:FF(PH2,PSI2): 

EQ3:FF(PH3,PSI3»: 

UNSOLVE((EQ1,EQ2,EQ3],tK1,K2,K3]),GLOBALSOLVE:TRUE; 

A3:A4/K1; 

A1:A4/K2; 

A2:SORT(A1**2+ A3**2+A4”2-2*A1*A3*K3); 
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/•..ALGEBRA  FOR  QUATERNIONS  FROM  YANG'S  PAPER..*/ 
NNPRED(N):-IS(N>-2); 

NNPRED(2); 

NNPRED(3); 

MATCHOECLARE(NN,NNPRED): 

TELLSIMPAFTER(EP"NN,0); 

AL12H:AL12+EP*A12; 

AL23H:AL23+EP*A23; 

AL34H :  AL34 + EP*  A34; 

AL4tH:AL41  +  EP*A41; 

TH1H:TH1+EP*S11; 

TH2H:TH2+EP*S2; 

TH3H:TH3+EP*33: 

TH4H:TH4-I-EP*S4: 

8AL12H:  EXPANQ(TAYLOR(SIN(AL12H),EP,0,1)); 

SAL23H:  EXPAND  (TAYLOR  (SIN(AL23H),EP,0, 03; 

SAL34H:  EXPAND(TAYLOR(aiN(AL34Hl,EP,0.1»; 

SAL41H:  EXPAND(TAYLOR(SIN(AL41H).EP.0.1»: 

3TH1H:  EXPAND(TAYLOR(SIN(TH1H).EP.0.1»; 

STH2H:  EXPAND(TAYL0R(SIN<TH2H|lEP10.1)); 

3TH3H:  EXPAND(TAYLOR(SIN(TH3H».EP(0,1)); 

STH4H:  EXPAND(TAYLOR(SIN(TH4HLEP.O,1)); 

CAL12H:EXPAND(TAYLOR(COS(AL12H|,EPA1»: 

CAL23H:EXPAND(TAYL0R(C03(AL23H),EP,0,1)); 

CAL34H:EXPANO(TAYLOR(COS(AL34H),EP,Ot1)): 

CAL4lH:EXPAND(TAYLOR(CO3<AL41H>,EP10,U): 

CTH1H:  EXPAND(TAYLOH(COS(TH1HI.EP.O,U); 

CTH2H:  6XPAND(TAYLOR(COS(TH2H).EP.O,1)): 

CTH3H:  EXPANDfrAYLORfCOSfTHSHhEP.O.I)); 

CTH4H:  EXPANDfT A YLOR (COS (TH4HI .EP.O. Ill; 
AATH1H:3AL12H*SAL34H*STH1H; 

BBTH1H:-SAL34H*(SAL41H,CAL12H+CAL41H*SAL12H*CTH1H); 
CCTH1H:CAL23H»CAL34H*(CAL41H*CAL12H-SAL41H*SAL12H*CTH1H3; 
EQ 1  :AATH  1  H*STH4H  +  BBTH 1  H*CTH4H-CCTH  1 H : 
PRIMARY:EV(EQ1,EP— 0); 

DUAL:RATCOEFF(EQ1,EP); 

A:RATCOEFF(PRIMARY,SIN(TH4»; 

B:RATC0EFF(PRIMARY.C03(TH4)J; 

C:EXPAND(PRlMARY-A*SlN(TH4)*B*COS(TH4|); 

DUAL  1 :  DUAL-S4*  (A‘COS  (TH4)-B*SIN(TH4)) ; 
A0:RATC0EFF(DUAL1,3IN(TH4)); 

BO:RATCOEFF(DUAL1.COS(TH4)>: 

CCO:EXPAND(DUAL1-AO*SIN(TH4)-BO*COS(TH4)); 

CC0:RATSIMP(CC0); 
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APPENDIX  XI 

/‘....THIS  PROGRAM  SETS  UP  THE  COMPLETE  MATRIX 

EQATION  FOR  ROBOT....FOR  DIFFERENTIAL  MOTION....*/ 
/•....modtlled  lor  Slanlwtl  manipulator....*/ 

/*CTH[l]:-COS(TH[IJ); 

STH[l]:— SINCTHIH);  V 
GCPRINT:FALSE: 

CALtl] :  -  COSlALlI]) ; 

SALMI; -SIN(AL[ll): 

AL[11:AL[4|:-%Pt/2; 

AL[2I:ALtS]:%PI/2; 

AL|3):AL[0]:O; 

AA[1]:AA[3);AA[4];AA[5|:AA[0];Q; 

AA[2J:0; 

D0(1):DDt4l:DD[5l:DD[6l;0; 

CTH  131:1; 

STHt3l:0; 

An]:-MATRIX((CTHll).-STHtl|*CAL[ll1STHfl|*SALtl),AA[l]*CTH[l]l, 

tSTHMI,CTHtlJ*CALtll.-CTHMI*SAL[l].AAtl)'STH[l]|, 

[0, SALMI, CALM, DDM11, 

[0,0.0,11); 

AMI; 

RATSUBST(l-(STH[1I)**2,(CTH(l]>**2,%); 

V‘-1; 

IA[1]:%$ 

RATSUBST(1-(3TH[11)**2.(CTH[1])"2,%); 

IA(11:%S 

A[2J: 

V-i: 

fiAT5UBST(1-(STH[2))**2.[CTH[2l)**2,%); 

IA12];%S 

A|3]; 

V-1; 

RATSUBST(1-<STH(3l)**2f(CTHl3l)**2.%); 

1Al3]:%£' 

A{4|; 

V*-1; 

RATSU0ST(1-(STH[4|)**2.[CTH[4])**2,%); 

1At4);%S 

A(si: 

V*l: 

RATSUBST(l-OTH[5l)**2,(CTH[6l)**2,%); 

IA[5]:%$ 

Ate]; 

%“*1; 

RATSUBST(1-(STH[0]r*2.(CTHte])"2,%); 

IA[0];%; 

T56:A[6]; 

T46:A[5).A(0I; 

T30:A[4|.A[6l.At6l: 

T20:A[3],A(4].A[5|.A[0): 

T10:A[2].A(3].A[4].A[5].A[0|; 

T6:A[1],A(2).A[3).A[4].A[51.A[01$ 

NX:T0[1,1]$ 

NY:T0[2,1]$ 

NZ:T6[3.1]$ 

OX:T6[l,2|$ 

OY:T6[2,2l$ 

OZ:T6[2,3]$ 

AX:T0tl.3l$ 

AY:TS[2,31$ 

AZ:T0(3,3l$ 

PX:T6M,4|$ 

PY:T0[2,4J$ 

PZ:T0[3,4J$ 

TT0iMATRIX<[NNX;OOX,AAX.PPX]. 

[NNV.OOY.AAY.PPY], 

[NNZ.OOZ.AAZ.PPZI. 

.  [0,0,0,11); 

IA1T6-.IA111.TT6; 

IA2T0:IA(2].%: 

IA3T6:fA[3).*Xi: 

IA4T6:IA[4).%S 

IA6T0:IA[5).%$ 

IA0T0:IA[8].%$ 

/‘....dllforentlal  relations.. .this  may  tie  used 
for  obtaining  the  sensitivity  analysis... we 
follow  the  agorlthm  provided  by  R.P.  Paul 
PAlGE  no.103....REVOLUTE-TDR  PRlSMATIC-TDP.V 

TpR(MAT):-BLOCK([NX,NY,NZ,PX,PY,PZ,OX.OY.OZ.AX.AY,AZ], 

NX:MAT[1,ll.NY:MAT[2,lJ,NZ:MAT[3,1|, 

OX;MAT[1,2l.OY:MATl2.2l,OZ:MAT[3,2], 

AX:MAT(1,3).AY:MAT[2.3].AZ:MAT[3.3l, 

PX:MAT[1,4),PY:MAT[2,4|,PZ:MAT(3.4|, 


TRANSPOSE(MATRlX«-NX*PY+NY*PX,-OX*PY  +  OY*PX.-AX*PY+AY*PX,NZ,OZ,AZM); 

TDP(MAT):-BLOCKUNX,NY.NZ,PX1PY.PZ1OX.OY.OZ.AX,AY,AZ], 

NX:MAT[1.ll,NY:MATl2,11,NZ:MAT[3.ll, 

OX:MAT[1,2l,OY:MAT[2f2],OZ:MAT[3,2l1 

AX:MATll,3l,AY:MATl2.3l.AZ:MATl3t3!, 

PX:MATtl,4l,PY:MATl2,4])PZ:MAT(3,4], 

TRANSPOSE(MATRIXUNZ,OZ,AZ,0,0,0l))); 

/*...,NOW  WE  SET  UP  COMPLETE  DIFFERENTIAL  MATRIX....*/ 

C0H1J:TDR(TB); 

COL[2);TDR(T10); 

COL[3|:TDP(T20); 

COLI4|:TDR(T30); 

COL(5l  :TDR  (T  46) : 

COU01:TDR(TS0); 

FOR  J:1  THRU  0  DO  (FOR  1:1  THRU  0  DO  (DIFFARRAY[I.J):COLtJ][l»); 

jacobian:genmatrix(diffarray.0,0): 

/•....NUMERICAL  EXAMPLE  PAGE  107  ROBERT  PAUL....*/ 

TH[1):TH[4]:0; 

THl2):TH|5):TH[0):%PI/2; 

TH13]:0; 

00(31:20; 

FOR  1:1  THRU  0  DO  (CTH[II:COS(TH[l|).STH[l]:SIN(TH(ll)): 

JAC:EV(JACOBIAN,NUMER); 

JAC:EV(%,NUMER); 

DQ:TRANSPOSE(MATRlk(l0.1, -0.1.2.0,0.1,0.1,0.11)): 

JAC.DQ; 
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/*  EXAMPLE  OF  ROBOT  CONSIDERD  BY  LUMELSKY  */ 

CTH(!|;-COS(TH[l|); 

STH  [1]  -  SIN  (THM1) ; 

GCPRINT:FALSE; 

CAL[ll:-COS<AL[l]): 

SALM) :  ■  SIN(ALtl)) ; 

AL[1):ALl2):AU4):%PI/2; 

ALl3l:AL[0l:O: 

AUSj:-%P1/2; 

A  A  [  1) :  A  A[3]  :AA[4) :  AA(5] ;  AA[S]  :0; 

AAI21:A2; 

DD|1]:DD[2]:DD[4]:0; 

CTH(3|:1: 

STH[3I:0: 

A[l):“MATRIX([CTH[l|.-STHM)*CALMl,STHm*SAL[ll,AAM)*CTH[l]l, 

[STH[|].CTH[l),CAL[IJ..CTHin*SALtll.AA[ll*STH[lll, 

tO.SALM)  ,C  ALfll.DD  [III ,  [0.0,0, 1 1 ) ; 

Alii; 

RATSUBST<1-STH[1)'2,CTH[tr2,%): 

A[1]:%$ 

RATSUBST(1-STHfir2.CTH[ir2,%): 

A[11:%S 

AI21; 

%“M): 

RATSUBSTt  l-STHt2r2.CTHl2r2.lt): 

Att):%S 

A[3): 

RATSUBST(1-STH[3]'2,CTH[3r2,%); 

A[3]:%£ 

A|4|; 

%'"M): 

RATSUBSTO  -STHMr  2.CTH  (41'  2.%) , 

A[41:%$ 

A  [61; 

R  ATSUBST  ( 1  -STH  1ST  2  ,CTHl5l"  2, It) ; 

A(6]:%S 

A(6I: 

R  ATSUBSTd  -STHter  2,CTHl©r  2.%) : 

A[0):%: 

T66:A|6|; 

T40:A[S]  .  A [01; 

T36.A14)  .  AI51  .  A[6l; 

T26:A[31  ,  A{4) .  A(Sl  .  Ate): 

T10:A(2l  .  A (3)  .  A[4l  .  At5l  .  A|6): 

T0:A|1|  .  A[2|  .  A [31  A[4)  ,  A(S|  .  A[e|S 
NX:T611,11: 

NY:T0[2.ll; 

NZ:Te[3,ll; 

OX:T0[1.2J: 


OY:T0[2.21: 

OZ:T6|Z,3): 

AX:T6tl,3); 

AY:T0[2,3J; 

AZ:T6[3,3l: 

PX:T6ll,4]; 

PY;T012,4]; 

PZ:T6t3,4|; 

TT0:MATfilX(lNNX1OOXtAAX.PPX],tNNY,OOY,AAY,PPY],[NNZ.OOZ,AAZ,PPZ|,[0, 0,0.1]): 
A1T6:IA(11  .  TT0; 

A2TB:IA12]  ,  %; 

A3TB: I A[31  .  %; 

A4T0:IA[41  .  %; 

A5T6:IA[S]  .  %; 

A0T6:!At0]  .  %;  ' 

EQ1:TT6-TB$ 

EQ2:IA1T6-Tie$ 

EQ3:IA2T0-T20$ 

6q4:IA3T0-T36$ 

EQ6:IA4T0-T40$ 

EO0:IA5T0-T50$ 

TABLE  (MAT, VAR):- BLOCKUEQJ, FOR  I  THRU  4  DO  (FOR  J  THRU  4  DO  EQ[l,J!:0), 
FOR  l  THRU  4  DO  (FOR  J  THRU  4  DO  (FOR  L  THRU  8  DO 
(IF  FREEOF(VAR[L],MATtUl)  -  FALSE  THEN  EQ[l,J]:EQtl,J]+T[U 
ELSE  FALSE))), GENMATRIX(EQ.4,4)); 

VAR[1]:TH[1); 

VAR(21:TH(2]; 

VAR(3|:DD(3]; 

VAR[41:TH{4]; 

VAR[6l:TH(5J: 

VARt0l:THt0]; 

TA0LE(EQ1.VAR): 

TABLE(EQ2,VAR); 

TABLE(EQ3,VAR); 

TABLE(EQ4.VAfl>; 

TABLE(EOS,VAR>; 

TABLE(EQ0,VAR); 


APPENDIX  XII  (CONT.) 

FOLLOWING  IS  A  PARTIAL  OUTPUT  FROM  ABOVE  PROGRAM: 
T1  T2  ETC  REPRESENTS  PRESENCE  OF  VARIABLE  1  2  ETC. 

TABLE(EQ2.VAR); 


T0  +  T5,f  T4  +  T2  +  T1 

T0  +  TS  +  T4  +  T2  +  T1 

COL  1  ” 

Te  +  TS+T.4  +  T2 

T0  ,  t5  +  t4  l  T1 

COL  2  — 

T0'(-T5  +  T4+T2 

VVt4iTi 

0 

0 

t5+t4+t2+t1 

T5  +  T4  +  T3  +  T2  +  T1 

COL  3  - 

ts+t4+t2 

ts+t4+t1 

COL  4- 

T5  +  T4  +  T3+T2 

0 

0 

TA0LE(EQ3,VAR); 


COL1  - 


T6  +  T5  +  T4  +  T2  +  T- 
Te+T5+T4+Ti 
To  +  Tc  +  Tn  +  T* 


COLS 


0 

'T5  +  T4  +  T2  +  Ti1 

T5  +  T4  +  T1 
ts+t2+ti 
o 


COL  2  - 


COL  4  ^ 


Te+Ts+T4+T2+Tr 
T6  +  T5  +  T4  +  T1 

VT5+T2  +  T1 
0 

TS  +  T4  +  T2  +  T1 
TS  +  T4  +  T1 
t6  +  T3  +  T2  +  T1 
0 


TABLE(EQ4,VAR); 


COL  1  - 


COL3  ■ 


|T6  +  TS  +  T4  +  T2  +  T1 
TQ+1'g+",’4+Ti 
Tq  +  Tr  +  Ta  +  T, 


T5  l'T4  '  T2't"T 
Tc  +  T.+T, 


Tc  +  T-j  +  T, 


COL  2  - 


COL  4- 


T0  +  T5  +  T4  +  T2  +  Tll 
T0  +  T5+T4+T1 
T0  +  T5  +  T2  +  T1 
0 

TS  *  T4  '  T2  »  T1 

Ts  +  T4+Ti 

T5  +  T3  +  T2  +  T1 
0 


TABLE(EQS,VAH): 


COL  1  - 


COL  3  - 


T^+Tg+T^  +  T^  +  Ti 
T6  +  TS  +  T2  +  T1 

T0  +  T4  +  t2  +  ti 
0 

[T=+Ta  +  T,  +  T 


Ts  +  ^+t, 

Ti  +  Tn  +  Ti 


COL  2  — 


COL  4  - 


Tg  +  Tg  +  T^+Tg-F^ 
Te+  Tg  +  Tg  +  T^ 
T0  +  T4  +  T2  +  T1 


Tg  +  T^Tg  +  T, 
T5  +  T3  +  T2  +  T1 
T4  +  T2+T1 
0 


TABLE(EQ0.VAR): 


COL1  - 


COLS  - 


fT6  +  T5  +  T4  +  T2 

+  T1 

T6  +  T5  +  T4  +  T2  +  T1 

T0  +  T4  +  T2  +  T1 

COL  2  - 

Wt2+ti 

Ts  +  T4  +  Tg  +  T1 

T5  +  T4  +  T2  +  T1 

0 

0 

Tg  +  ^+Tg  +  T, 

Tg  +  T4  +  Tg  +  T2  +  T1 

T4  +  Tg  +  T1 

COL  4- 

T4-FT2  +  T1 

Ts  +  T^Tg  +  T, 

TS  +  T4  +  t3  +  T2+T1 

0 

0 

A-5 

164 


Appendix  B 

SOME  REMARKS  ON  MACSYMA  COMMANDS 

We  assume  that  the  reader  is  familiar  with  the  introduction  given  in  the  MACSYMA 
Primer,  which  is  an  introduction  for  beginners  (see  Reference  [8]). 

We  will  use  EXPR  to  denote  any  symbolic  expression  such  as  X+  Y ,  SIN(Y),  etc. 

F;  EXPR; 

assigns  EXPR  to  the  variable  F .  (In  the  reference  manual,  F  would  be  called  an  atomic  vari¬ 
able.)  Note  that  F(X)  has  no  meaning  in  this  context.  However,  if  we  write 

F(X):«  EXPR 

we  are  now  defining  a  function  F(X).  For  example: 

F(X):=  SIN(X); 

F(Z) ;  (Machine  prints  SIN(Z)) 

This  can  also  be  achieved  by  the  LAMBDA  notation 
F:  LAMBDA ([X],SIN(X)); 

We  can  now  use  F(Z).  For  example: 

F(Z);  (Machine  prints  SIN(Z)) 

One  advantage  of  this  procedure  is  that  F  can  be  an  argument  to  another  function,  for  exam¬ 
ple  as  in  SIMPSON  in  the  text  (Section  6). 

An  abbreviated,  partial  list  of  the  commands  that  have  been  used  in  this  report  follows. 
For  details,  see  the  MACSYMA  Manual. 

DEPENDS([R,PURHO]);  =  R  and  P  are  functions  of  p. 

DEPENDENCIES(R(RHO));  =  R  -  R  (p),  as  in  the  last  command. 

DIFF(SIN(X),X);  =  dsin(x)/(ix 

EV(EXPR,X  =  0);  ^  Evaluate  EXPR  with  x=0.  EV  is  a  powerful  command  in 

MACSYMA  and  takes  multiple  arguments.  See  Manual. 

GRADEF(R,RHO,P/R);  =  sat  ~  -  PI R 

bp 

INTEGRATE(SIN(X),X);  s  J  sin(x)<& 

LDISPLAY  =  Display  with  equation  numbers. 

LlNSOLVEUEQLEQZl.tX.Yl);  =  Solve  set  of  linear  equations  £^  =  0,  Eq2=  0, 

for  x  and  y. 

GLOBALSOLVE:TRUE;  =  Assigns  the  values  to  the  variables  obtained  by 

LINSOLVE  command. 

MAP(FACTOR,EXPR1);  =  Factors  each  part  of  EXPR  I  separately. 

RATCOEFF(EXPR,X*l)  =  Obtain  the  coefficient  of  x1  in  EXPR . 
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RATCOEFFlEXPR.X.l) 


RATSIMP(EXPR); 

SUBST(0,X,EXPR); 


=  Same  as  above.  If  the  third  argument  is  not 
specified,  as  in  the  last  command,  it  is  taken 
as  1  by  default. 

=  Obtain  rational  simplification  of  expression  EXPR. 
=  Substitute  0  for  x  in  the  expression  EXPR . 


RATSUBST(0,X,EXPR1); 

SUM(P[l]XAl,l,0,M); 

TAYLORtSINM.XAS); 


=  Same  as  SUBST  after  expansion. 

M 

=  I  Pi*1 

i=0 

=  Obtain  Taylor  expansion  of  sinx  around  x  =  0  up  to 
the  fifth  power.  Can  also  be  used  for  multivariate 
functions. 


TRIGEXPAND(EXPRl);  =  Expand  the  trigonometric  functions  in  the 

expression  EXPR1. 

FOR  1:1  STEP  1  THRU  N  DO  (ANY  MACSYMA  COMMAND);  (Involving  set  of  7’s) 

=  This  is  a  DO  loop  for  1  to  n  in  steps  of  1.  Note 
that  the  default  for  starting  is  1  and  that  the  default 
step  is  1;  FOR  I  THRU  N  DO(...)  will  accomplish 
the  same  task. 
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ON  THE  CONVERSION  OF  UNPACK  TO  ADA 


Benjamin  J.  Martin 

Department  of  Mathematical  and  Computer  Sciences 
Atlanta  University 
Atlanta,  Georgia  30314 


Robert  Bozeman 
Department  of  Mathematics 
Morehouse  College 
Atlanta,  Georgia  30314 


ABSTRACT*  The  authors  demonstrate  the  feasibility  of  converting 
the  UNPACK  routines  for  analyzing  and  solving  systems  of  linear 
algebraic  equations  from  FORTRAN  to  Ada.  This  is  done  with  minimal 
alteration  of  the  original  program  structure,  thus  requiring  very 
little  re-orientation  by  current  users  of  UNPACK.  Sample  programs 
are  included. 


I.  INTRODUCTION.  In  a  paper  entitled  "Can  Ada  Replace  FORTRAN  for 
Numerical  Computation?",  Alfred  Morris,  Jr.,  of  Naval  Surface 
Weapons  Center  in  Dahlgren,  Virginia,  argues  that  Ada  is  an  inade¬ 
quate  substitute  for  FORTRAN.  He  presents  several  points  which  he 
considers  to  be  critical  deficiencies  of  Ada.  Among  these  "criti¬ 
cal  deficiencies"  is  the  failure  of  the  Ada  specifications  to 
include  the  internal  representation  of  arrays.  The  FORTRAN  stan¬ 
dard  requires  that  arrays  be  stored  in  column  major  form,  that  is, 
columns  are  stored  together  one  after  the  other.  This  standard 
along  with  the  absence  of  strong  typing  allows  the  programmer  to 
access  the  elements  of  a  matrix  as  if  it  were  a  vector  and  always 
get  the  right  element.  It  is  this  standard,  among  others,  that 
has  permitted  the  development  of  the  UNPACK  routines. 

Mr.  Morris  points  out  that  this  problem  leads  to  the  necessity 
of  abandoning  a  large  selection  of  fully  developed  algorithms  and 
routines.  Says  he,  "it  is  clear  that  a  considerable  portion  of  a 
quarter  of  century  accumulation  of  logic  and  code  could  not  be 
adapted  to  Ada.  This  would  include  software  packages  such  as 
Argonne  National  Laboratories '  UNPACK  ...  which  is  highly  refined 
and  quite  widely  used  on  a  variety  of  computers."  It  is  the  aim  of 
this  paper  to  demonstrate  the  feasibility  of  recoding  the  UNPACK 
routines  in  the  Ada  language. 
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IK _ METHODS  OF  CONVERSION.  There  are  three  approaches  to  the 

recoding  of  the  UNPACK  routines  which  seem  obvious  and  straightfor¬ 
ward.  The  first  approach  is  to  merely  insert  the  appropriate 
codes  for  the  various  BLAS  routines  in  the  various  places  where 
the  subroutine  calls  are  made.  This  would  reduce  the  work  required 
in  performing  the  conversion.  The  effect  of  this  insertion  on 
execution  time  should  be  minimal  since  no  extra  code  is  being 
executed.  In  fact,  the  overhead  of  the  subroutine  call  is  saved. 
However,  it  would  significantly  increase  the  space  requirements 
for  the  program  code.  It  would  also  destroy  the  modularity  and 
the  readibility  of  the  routines. 

The  second  approach  is  to  process  the  matrices  and  vectors 
prior  to  the  BLAS  subroutine  call  and  postprocess  them  after  the 
BLAS  subroutine  call.  The  preprocessor  would  remove  the  appropriate 
portion  of  the  column  of  the  matrix  or  the  appropriate  portion  of 
the  vector.  ■  This  approach  is  placed  back  in  its  place  in  the  matrix 
or  vector.  This  approach  maintains  the  readability  and  modularity 
of  the  program.  The  increase  in  space  requirements  is  minimal  in 
that  only  four  short  routines  are  needed  in  addition  to  the  usual 
ones.  The  increase  in  time  requirements  is  also  minimal  since  the 
only  thing  these  routines  do  is  transfer  several  data  items  back 
and  forth. 

The  third  approach  is  to  define  the  matrix  to  be  an  array  of 
vectors.  After  the  matrix  to  be  used  is  properly  defined  the 
appropriate  vector  is  sent  to  the  BLAS  subroutine.  In  order  to  use 
this  technique  the  vectors  to  be  transferred  must  be  the  columns  of 
the  matrix.  This  may  require  a  routine  to  compute  the  transpose  of 
the  matrix  before  proceeding.  Therefore,  the  increase  in  space 
requirements  and  in  execution  time  should  be  minimal.  The  Ada 
concept  of  slices  may  prove  useful  in  this  approach. 

III.  THE  IMPLEMENTATION.  The  first  alternative  was  dismissed  as 
being  the  least  attractive  alternative.  The  second  alternative 
appeared  to  be  the  easiest  to  implement  quickly,  and  so  was  consi¬ 
dered  first.  The  third  alternative  is  presently  being  pursued. 

In  order  to  implement  the  second  alternatives,  LINPACK  and  BLAS 
routines  for  a  general  system  were  coded  in  Ada.  The  changes  made 
in  the  programs  were  of  two  types.  The  first  type  of  changes 
simply  involved  the  use  of  structured  programming  technique  making 
use  of  control  structures  not  available  in  FORTRAN.  The  second 
type  of  change  involved  writing  four  new  routines  called  CONVRTM, 
RECONVRTM,  CONVRTV,  and  RECONVRTV.  The  first  two  routines  work  on 
matrices.  CONVRTM  takes  a  given  portion  of  a  column  from  a  matrix 
and  stores  it  in  a  vector.  RECONVRTM  takes  a  portion  of  a  column 
from  a  vector  and  replaces  it  in  a  matrix.  The  other  two  routines 
do  similar  things  for  a  vector.  The  routines  are  used  in  conjunc¬ 
tion  with  the  BLAS  routines  as  follows: 
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The  FORTRAN  statement  CALL  SAXPY(N-K,T,A, (K+l ,K) , 1 ,B(K+1) , 1 ) 
is  replaced  by  the  Ada  sequence  CONVRTM(N-K,K+l,K,A,X,l) ; 

C0NVRTV(N-K,K+1 ,8, Y) ; 

REC0NVRTM(N-K,K+1 ,K,A,X, 1) ; 

RECON VRTV(N-K,K+1,B,Y) ; 

Attached  is  the  program  which  resulted  from  this  implementation 
and  which  clearly  defines  the  action  of  each  of  these  routines. 
These  routines  were  coded  under  ADAED,  version  16.3.  They  may  or 
may  not  run  under  other  versions  of  ADAED.  Because  of  the  nature 
of  ADAED,  extensive  testing  is  not  possible.  A  simple  system  of 
five  equations  in  five  unknowns  took  30  minutes  of  CPU  time  to 
compile  and  execute. 

IV.  CONCLUSIONS.  While  it  may  not  be  possible  to  retain  all  of 
the  characteristics  of  the  LINPACK  routines  in  a  conversion  to  Ada, 
it  has  been  demonstrated  that  such  a  conversion  is  possible. 
Because  of  the  limitations  of  ADAED,  especially  its  execution  speed, 
any  real  testing  must  await  a  compiler.  Nevertheless,  it  is  clear 
that  the  routines  can  be  fairly  easily  recoded  in  Ada.  The  costs 
incurred  in  this  recoding  cannot  be  determined  at  this  time.  Other 
approaches  may  also  be  available  besides  the  ones  indicated  above. 
The  third  alternative  may  be  the  nest  of  the  three.  There  are 
indeed  unanswered  questions,  but  we  must  conclude  that  it  is  feasi¬ 
ble  to  salvage  at  least  a  portion  of  a  "quarter  century  accumula¬ 
tion  of  logic  and  code." 
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WITH  TPXT.TO; 

PROCEDURE  TEST  IS 
USE  TEXT.IO; 

TYPE  REAL  IS  NEW  FLOAT; 

PACKAGE  HIS. 10  IS  HEW  INTEGER. 10 (INTEGER) J 
PACKAGE  MYllO  IS  NEW  FLO AT. 10  (PEAL) ; 

USE  HlS.IQf  USE  MY-IO; 

SUBTYPE  INDEX  IS  INTEGER  RANGE  1..S? 

TYPE  INTVEC  IS  ARRAY  (INDEX)  OF  INDEX  * 

TYPE  VECTOR  IS  ARRAY  (INDEX)  OF  PEAL  t 
TYPE  MATRIX  IS  ARRAY  (INDEX , INDEX )  OF  REAL; 

A J MATRIX; 

Bt  VFCTOR; 

IPVT? INTVEC* 

INF". JQR: INTEGER J=0; 

SIZESTNDEXJS5; 

FUNCTION  ISAMAX(NsrNDEX?X;VECTOH)  RETURN  INDEX  IS 

mm 

—FINDS  THE  INDEX  OF  ELEMENT  HAVING  «AXImUm  ABSOLUTE  VALUE. 
—  ADAPTED  FROM  THE  3/ll/7tj  VERSION  BY  JACK  DONGARRA 

IMax:Tnpex:=1; 

xmax:real; 

BEGIN 

IF  N<1  THEN  RETURN  0;  END  IF; 

IF  Nsl  THEN  RETURN  1;  END  IF; 

xmax;=apscxci) ) ; 

FOR  I  IN  2 . . N  LOOP 

IF  ASSCXCI) )>XMAX  THEN 
IMAX:=i ; 

XMAX.sABS(X(t) ) ; 

END  if; 

END  LOOP; 

RETURN  IMAX; 

END  ISAMAX; 


FUNCTION  SAS'JM ( N  S  INDEX »  X  i  VFCTOR  )  RETURN  REAL  IS 

--TAKES  THF  SUM  OF  THE  ABSOLUTE  VALUES*. 

—  USES  UNROLLED  LOOPS. 

--ADAPTED  FROM  THE  3/11/78  VERSION  BY  JACK  DONGARRA 

m  m 

*2 i integer; 

M.M1I IMDFXJ 
SUV:PEALt=0 .0; 

BEGIN 

IF  N=0  THEN  RETURN  SUM  *  END  IF; 

M2J=N  MOD  6; 

IF  M2  /=  0  THEN 
m:-m2; 

FOR  I  TN  1..M  LOOP 

SUMjeSUM  +  ABSCXU)  )  ; 

fnd  loop; 

end  if; 

TF  N  >S  a  then 

TF  M2  =  0  THEN  M 1  :  =  I  ?  ELSE  M 1 ; + 1 ; END  IF; 

WHILE  mi<n  LOOP 

SUM:=SUMTAaSCXCMl) l+APS(XCMltl ))TABSCX(Ml+2) 3+ABSCXCM1+3) ) 
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♦  ABSfXtMH.4))+AbS(X(Ml+5)); 
M2s=Ml+6? 

IF  M2<N  THEN  MlisM?,  ELSE  MlS=N?  END  IF? 
END  LOOP} 

END  IF? 

RETURN  SUM  ? 

END  SASUM? 


PROCEDURE  SAXPYCN:rNDEX;S;REAL:X* VECTOR} Y:  IN  OUT  VECTOR)  IS 
^-COMPUTES  A  CONSTANT  TIMES  A  VECTOR  PLUS  A  VECTOR. 

7-USES  unrolled  LOOPS. 

--ADAPTED  FROM  THE  3/11/78  VERSION  BY  JACK  QONGARRA 

mm 

M ,Mli INDEX? 

M2 i INTEGER? 

BEGIN 

if  n  <=  o  then  return?  end  if? 

if  5=0.0  THEN  RETURN?  END  IF? 

W2:=N  MOD  4? 

TF  M2  /=  0  THEN 

m;=m2; 

FOR  I  IN  I..M  LOOP 
YEI) :=YtI)+S*X(I); 

END  LOOP? 

END  IF? 

IF  N  >s  4  THEN 

IF  M2=0  THEN  MlS=l?  ELSE  Mli=M+i;  END  IF? 

WHILE  Ml <N  LOUP 

YCMi)  ;=Y(Ml)+5*XCMn  ? 

Y  C  wl  +  l )  i=Y  (Mt-M)+S*X(M1+1)  ? 

Y(Mit2)  :=YC.Ml+2)^S*XfMl-*-2)  ? 

YCM1+3)  :=YCM1+3)+S*XCMW3j  ? 

M2:=Ml+4? 

TF  M2<n  Then  Ml;=M2;  else  M1J5M;  end  if? 

END  LOOP? 

END  TF? 

EXCEPTION 

,JHEN  CONSTRAINT_ERROR=> 

PUT  f "CONSTRAINT  ERROR  IN  SAXPY  N#  Ml.  M *  ")?  PUTCN)? 
PUT(Ml) ;PuTtM) : 

END  SAXPY? 


FUNCTION  sDOTCNi  TND£X;X.YjVL'CTOR)  RETURN  REAL  IS 

mm 

--FORMS  THE  DOT  PRODUCT  OF  TWO  VECTORS. 

7-USES  UMROLLED  LOOPS, 

--ADAPTED  FROM  THE  3/11/78  VERSION  BY  JACK  OONGARRA 

mm 

M2 -INTEGER? 

M.MU  INDEX? 

temp:rfal;=o,o? 

BEGIN 

IF  N<cO  THEN  RETURN  0.0?  END  IF? 

M2;  =  N  i-mo  5? 

if  m2  /=  o  then 

MS=M2; 

FOR  T  TN  1..M  LOOP 

TEMP:=TEMP^X(I)*Y£I3  • 
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END  TjQOP } 

EMD  if? 

IF  N>s5  THEN 

if  M2=o  then  mi?=i?  ELSE  MljsM+lsEND  if? 
while  M1<N  loop 

TEMPS sTEMP+X(M1 )*Y(Ml)? 

M2S3M1+5? 

IF  M2<N  THEN  M1;=M2?  ELSE  Ml! sN ;  END  IF; 
END  LOOP? 

END  IF? 
return  temp? 

END  SCOT? 


PRncFDtIRF  SSCALCN!  INDEX  ?  S I  REAL  j  X:  IN  OUT  VECTOR)  TS 

mm 

—SCALES  A  VECTOR  BY  A  CONSTANT. 

— USES  UNROLLED  LOOPS 

•-ADAPTED  FROM  THE  3/11/78  VERSION  BY  JACK  DPNGARRA 
M2 : INTFGFR  ? 

w.niundfx; 

BEGIN 

IF  N<*0  THEN  RETURN?  END  IF! 

M2:=N  MOD  5? 

IF  M2  /=  0  THEN 

FOR  I *  IN  t . . M  LOOP 
XCI) !=5*XCI)? 

END  LOOP? 

END  IF? 

IF  N>=5  THEN 

IF  M2=0  THEN  Mlsal?  ELSE  »li=M+l;FNO  IF? 

WHILE  Ml <N  LOOP 
XCM1) :=s*x(Mi) 1 
X ( M 1 + 1 ) !=S*XCMH-l) t 
XCM1>2) s=S*XCMtt2) f 
XCN1+3) iaS*X(Ml+3) t 
XC'*1+4):=S*XCMH'4)? 

M2:=Mi+5; 

IF  M2<N  THEN  Ml:=M2;  ELSE  MlS=N?  END  IF? 

END  LOOP; 
end  IF? 

RETURN? 

PND  SSCAL? 

w  * 

PROCEDURE  CONVRTMCN,K,L:INDEX:AjMATRlX;V:OUT  VECTOR?  INCjINDEX)  IS 
BEGIN 

IF  INC=1  THEN  —  PLACE  COLUMN  IN  VECTOR 

FOR  I  IN  1..N  LOOP 

VCI) :=A(K+T-1(L)? 

END  LOOP? 

FLSIF  TNOl  THEN  —  PLACE  ROW  IN  VECTOR 

FOR  I  IN  1..N  LOOP 

vci) :=A(K.l+i-1)7 
end  LOOP! 

end  tf? 
pnd  convrtm? 


PRnCFDURF  CONVRTV  t  M  ,  K  S I NDEX ; B  j VECTOR ; V !  OUT  VECTOR)  IS 
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BEGIN 


FOB  T  IN  1..N  LOOP 

V(I) laACKfl-t] t 
END  LOOP; 

ENn  CONVRTVJ 

PROCEDURE  RECONVRTMCN.K.LilNDEXiAjOUT  *ATRIX ? V : VECTOR ; INC i INDEX )  IS 
BEGIN 

IF  INC=1  THEN  —  PLACE  COLUMN  TN  VECTOR 

FOR  I  IN  1..M  LOOP 

ACK+I-1 ,L) isVCI) ; 

ENO  LOOP; 

ELSIF  TNOl  THEN  —  PLACE  ROW  IN  VECTOR 

FOR  I  IN  1..N  LOOP 

ACK.L+I-n  !  =  VCI) ; 

END  LOOPS 

end  if; 

END  RECONVRTM? 


PROCEDURE  RECONVRTV (N.KjINDEXrfliOUT  VECTOR t V I  VECTOR )  IS 
BEGIN 

FOR  I  IN  1,.N  LOOP 

bck-h-1  )  :=vci)  t  . 

END  LOOPS 

END  peconvrtv; 

PROCEDURE  SGESLCASIN  OUT  M ATP I X ; LDA , N : I NDEX J T PVT t IN  OUT  INTVEC; 

B: IN  OUT  VECTOR; JOBS  INTEGER)  IS 


SGESL  SOLVES  THE  REAL  SYSTEM  A  *  X  =  B 

OR  TRANSCA)  *  X  =  B 
USING  The  FACTORS  COMPUTED  BY  SGECO  OR  SGEFA. 

ON  ENTRY  * 

A  THE  OUTPUT  FROM  SGECO  OR  SGEFA* 

LDA  INDEX 

THE  LEADING  DIMENSION  OF  THE  ARRAY  A. 

N  INDEX 

THE  ORDER  OF  THE  MATRIX  A. 

IPVT  The  INDEX  PIVOT  VECTOR  FROM  SGECO  OR  SGEFA, 

R  THE  REAL  RIGHT  HAND  SIDE  VECTOR. 

JOB  INDEX 

=0  TO  SOLVE  A*X=B. 

<>0  TQ  SOLVE  TRANSCA) *X=B  WHERE  TRANS(A)  IS  THE  TRANSPOSE. 

ON  EXIT 

R  THE  SOLUTION  VECTOR  X, 

ERROR  CONDITION 

A  DIVISION  BY  7.ERO  WILL  OCCUR  IF  THE  INPUT  FACTOR  CONTAINS  A  ZERO  ON 
THE  DIAGONAL.  TECHNICALLY  THIS  INDICATES  SINGULARITY  BUT  IT  IS  OFTEN 
CAUSED  BY  IMPROPER  ARGUMENTS  OR  IMPROPER  SETTING  OF  LDA.  IT  WILL  NQt 
OCCUR  IF  THE  SUBROUTINES  APE  CALLED  CORRECTLY  AND  IF  SGECO  HAS  SET 
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--  RCHNOO.O  OR  5GEFA  HAS  SET  IMFO=0. 

«*  mt 

—  TO  COMPUTE  INVERSE(A)*C  WHERE  C  IS  A  MATRIX  WITH  P  COLUMNS 

—  SGECOCA ,LDA  f N,  U>VT.RCQMO. Z) ? 

7-  IF  crcond  NOT  IS  TOO  small)  then 

«  FOR  J  IN  1..P  LOOP 

?*  5GE5LCA4LDA.N,TPVT*CC1*J); 

—  END  LOOP; 

—  END  IF 1 

m  ■§ 

7-  LINpACK,  THIS  VERSION  IS  BASED  ON  THE  09/14/78  VERSION  BY 

7-  CLFVE  HOLER,  UNIVERSITY  OF  NEW  MEXICO,  ARGONNE  NATIONAL  LAB. 

m  « 

7-  PROCEDURES  AND  FUNCTIONS!  BL AS ,  SAXPY,  SQOT 

«■  mt 

k,kb,l.nmuindex;  --internal  variables 
T:  real; 

X , Y: VECTOR  ? 

BEGIN 

NM1J=N-Iy 

IF  JUR-0  THEN  --  SOLVE  A*X  =  B 

IF  NM1>0  THEN  —  FIRST  SOLVE  L*Y=B 

FOR  K  IN  1..NM1  LOOP 
L:*IPVT(K); 

TS*B<L): 

IF  L/=K  THEN 

BfL) ;=BfK) ; 

B(K) !=T? 

END  IF  f 

CONVRTM (N-K ,K+l,K,AfXfl)? 

CONVRTV (N-K , K+l , R,Y) ? 

SAXPYCN-K,T,X.Y) ? 

RECONVRTM(H-K,K+t .K, A,X, I ) ? 

RECONVRTVCN-K.K+l .B.Y) J 
END  LOOP? 

END  IF? 

For  kb  in  1..N-1  LOOP  —  NOW  SOLVE  U*X  =  Y 

k:=n+1-KB; 

BCK) !=BCK)/A(K,K) ; 

t:=-bck) ? 

CONVRTM<K-l,l,K,A.X.i); 

CONVRTVCK-l,l,B,Y); 

SAXPYCK-l,T,XfY) i 

RECONVRTMCK-l,l,K(A,X.n? 

RECONVRTVCK-1, l jri.Y) ; 

END  LOOP? 

R(n?=BCl)/A(l,l)j  --THIS  AVOIDS  A  CALL  TO  SAXPY  WITH  K-1*0 
ELSE  —  JQBOO.  SOLVE  TRANS(A)*X=B 

BC1):=BU)/A(l,l);  —  THIS  AVOIDS  A  CALL  T'O  SDOT  WITH  K-1  =  0 

FOR  K  IN  2..N  LOOP  --  FIRST  SOLVE  TRAN5CU)*Y=B 
CONVRTMCK-itl.K, AfX,l) ? 

CONVRTV  CK-1 # 1 ,B  #  Y ) ; 

T:=SDbTCK-l,X,B)! 

B(K):=CBCK)-T)/ACK.K)? 

END  LOOP; 

IF  NM1>0  THEN  --  NOW  SOLVE  TRANSCLJ*X=Y 

FOR  KB  IN  1  • « NM 1  LOOP 
K*  r N-KB * 

CUNVRTMCN-K,K+i .K.A.X.l) ? 

CONVRTVCN-K,K+i ,R,Y) ; 

BCK) !=B(K)+5D0TCN-K,X,Y) ? 
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L:  =  IPVT(K)  ■ 

IF  L/sK  THEN 
T  J  sB (L )  • 

B(L)  ;sb(K) ; 

0CK):=T; 

END  IF? 

END  LOOP? 

END  IF» 

END  IFJ 
END  SGESL; 

PROCEDURE  SGEFACAsIN  OUT  MATRIX ;  Ll)A  ,  N  :  INDEX ; I PVT: IN  OUT  INTVEC; 

INFO’:  OUT  INTEGER)  IS 

mm 

--  SGEFA  FACTORS  A  PEAL  MATRIX  RY  GAUSSIAN  ELIMINATION 

mm 

—  SGEFA  IS  USUALLY  CALLED  BY  SGECO.  BUT  IT  CAN  BE  CALLED  DIRECTLY  WITH 

—  A  SAVING  IN  TIME  IF  RCOND  is  NOT  NEEDED. 

7-  (TIME  FOR  SEGCO)  =  C 1  +  9/M)*(TIME  FOR  SGEFA). 

mm 

-•  ON  ENTRY 

«  m 

A  the  real  matrix  TU  be  factored. 

--  LDA  INDEX 

r-  THE  LEADING  DIMENSION  OF  THE  ARRAY  A, 

mm 

--  N  INDEX 

The  order  OF  THE  MATRIX  A. 

7-  ON  RETURN 

mm 

—  A  AN  UPPER  TRIANGULAR  MATRIX  AND  THE  MULTIPLIERS  WHICH  WERE  USED 

—  TO  OBTAIN  IT.  THE  FACTORIZATION  CAN  BE  WRITTEN  A  =  L*U  WHERE 

—  L  TS  A  PRODUCT  OF  PERMDTATINS  AND  UNIT  LOWER  TRIANGULAR  HATRICH 

—  AND  U  15  UPPER  TRIANGULAR, 

m  m 

«  IPVT  AN  INDEX  VECTOR  OF  PIVOT  INOTCES. 

—  INFO  INTEGER 

=0  NORMAL  VALUE. 

--  =K  IF  UCK.K)  =  0.  THIS  IS  NOT  AN  ERROR  CONDITION  FOR  THIS 

—  SUBROUTINE.  BUT  IT  DOES  INDICATE  THAT  5GESL  OR  SGEOI  WILL 
DIVIDE  BY  ZERO  IF  CALLED.  USE  RCQND  IN  SGECO  FOR  A  RELIABli 

—  INDICATION  OF  SINGULARITY. 

m  m 

LINPACK,  THIS  VERSION  BASED  ON  THE  08/14/78  VERSION  BY 

—  CLEVE'MQLER,  UNIVERSITY  OF  NEW  MEXICO.  ARGONNE  NATIONAL  LAB. 

—  SUBROUTINES  AND  FUNCTIONS:  BLAS.  SAXPY.  SSCAL,  ISAHAX 

mm 

J.N.Kpl.L.WHl# INDEX  *  —  INTERNAL  VARIABLES 

TlPEAL; 

X.YSVECTOR: 

BEGIN 

GAUSSIAN  ELIMINATION  WITH  PARTIAL  PIVOTING 
INFOSsO; 

mmi :=n-i : 

IF  N M 1 > 0  THEN 

FOR  K  IN  1..NM1,  LOOP 
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KP1ISK+U 


?-  PULL  OUT  THE  APPROPRIATE  COLUMN  VECTOR  FOR  ISAMAX 

CONVRTMCM-K+l .K.K.A.X, 1) t 

LJSISAMAXCN-K+i .XJ+K-ll  —  FIND  L  =  PIVOT  INDEX 

IPVT(K):*LJ  —  ZERO  PIVOT  IMPLIES  THIS  COLUMN  ALREADY 

TF  ACL.KlsO.OEO  THEN  INFO :=INTEGER C K 1 ;  —  TRIANGULA^ 

ELSE 

IF  L/sK  THEN  —  INTERCHANGE  IF  NECESSARY 

T;sA(L,K); 

ACL.K):=ACK.K)> 

ACK,K):=TJ 
END  IF t 

T;=-t.0E0/ACK,K) ;  —  COMPUTE  MULTIPLIERS 

CONVRTM(N-K,K  +  l,K, A.X.l)  t 
S SCALCN^K.T.XI t 
R£CONVRTMCN-K,K  +  l ,K.  A.X.l)  J 

FOR  J  IN  KPI..N  LOOP  —  RQW  ELIMINATION  WITH 

T:=ACL.J)J  —  COLUMN  INDEXING 

IF  L/=K  THEN 

:=A(K, J)  ; 

A(K.J) !=T ; 

END  IF; 

CDNVRTM(H^K,K^1 ,K.  A.X.l) t 
CONVPTMCN-K.K+1 .J.A.Y.1H 
SAXPYCN-K.T.X.Y) ? 

RECON VRTM (N-K. K  +  l  ,K, A.X.l) 7 
RECONVRTMCN-K.K+l. J.A.Y, 1) i 
END  LOOP i 
END  IF; 

END  LOOP.* 

END  if; 

tpvtCn):sn; 

IF  A(N*N)S0.0E0  then  INFORM;  END  IF? 

END  SGFFAJ 
BEGIN 

PUT.LINEC’TYPE  IN  THE  MATRIX  A"); 

FOR  I  IN  INDEX  LOOP 

FOR  J  IN  INDEX  LOOP 

getcaci, j) ) ; 

END  LOUP; 

GET(BCI)) t 
END  LOOP; 

FOR  I  IN  INDEX  LOOP 
FOR  J  IN  INDEX  LOOP 

PUT  (ACl.Jl);  PUT  C "  '' )  r 

END' LOOP; 
new.line; 

PHTCBCI) ) ; 

NEW-LINE; 

END  loop; 

SGEFACA.STZF.SiZE.lPVT.iNFOl ; 

SGESLCA.STZE.SiZE.IPVT.D.JOP) t 
FOR  I  JN  INDEX  LOOP 

PUTCaCin  ;p»'tcm  "); 
end. loop; 

NFW-LINF; 

END  TEST; 
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AUTOMATIC  GENERATION  OF  TAYLOR  SERIES  IN  PASCAL-SC: 
BASIC  APPLICATIONS  TO  ORDINARY  DIFFERENTIAL  EQUATIONS 
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Marquette  University,  Milwaukee  WI  53233 

and 
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ABSTRACT .  Taylor  series  have  a  long  history  of  usefulness  in  numerical 
analysis,  especially  for  the  numerical  solution  of  the  initial-value  problem  for 
systems  of  ordinary  differential  equations.  Recurrence  relations  for 
coefficients  of  Taylor  series  are  well-known,  for  arithmetic  operations  and 
various  standard  functions  with  series  arguments.  Compilers  for  languages  such 
as  Pascal-SC,  Algol  68,  and  Ada"1  (a  trademark  of  the  U.  S.  Department  of 
Defense)  have  built-in  facilities  for  the  support  of  user-defined  data  types  and 
operators  which  allow  automatic  generation  of  machine  code  to  evaluate  Taylor 
coefficients.  In  addition,  Pascal-SC  (Pascal  for  Scientific  Computation)  offers 
accurate  floating-point  and  interval  arithmetic  for  numerical  calculations .  In 
this  language,  series  with  real  (interval)  coefficients  are  introduced  as  type 
TAYLOR  (ITAYLOR).  The  operators  +,  -,  *,  /,  **  and  the  functions  SQR,  SQRT, 

EXP,  SIN,  COS,  ARCTAN,  and  LN  are  implemented  for  arguments  of  types  TAYLOR, 
ITAYLOR,  INTEGER,  REAL,  and  INTERVAL.  An  initial-value  problem  for  an  ordinary 
differential  equation  is  solved  using  types  TAYLOR  and  ITAYLOR.  A  stability 
analysis  shows  that  the  recurrence  relations  for  the  series  generation  exhibit  a 
mild  instability  which  has  no  significant  effect  on  the  values  of  the  solution 
computed  by  analytic  continuation. 

AMS  (MOS)  Subject  Classifications:  65-04,  65G10,  65LQ5,  65L07,  65V05 

Key  Words  and  Phrases:  Taylor  series,  recurrence  relations  for  Taylor 
coefficients,  automatic  differentiation,  numerical  solution  of  ordinary 
differential  equations,  stability,  error  analysis,  interval  arithmetic 

1 .  TAYLOR  SERIES ,  POLYNOMIALS ,  AND  FORMS .  A  fundamental  tool  of  numerical 
analysis  is  the  expansion  of  a  real  function  f  of  a  real  variable  x  into  a 
Taylor  series  at  x  =  xQ,  which  gives  the  expression 


(1.1)  f(x)  -  I  fU  ’’(x  )(x  -  X  )(l_1)/(i-1)!, 

i  =  1 

valid  for  |x  -  xQ|  <  p,  where  p  is  the  radius  of  convergence  of  the  infinite 
series  on  the  right-hand  side  of  (1.1).  Of  course,  in  actual  numerical 
computation,  the  Taylor  polynomial 
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(1.2)  f  (x)  =  I  f (i_1)  (x„)(x  -  xn)(i_1)/(i-l)  l, 

n  i=1  00 

is  used  in  place  of  the  infinite  series.  This  results  in  the  truncation  error 

(1*3)  ( f , x ; x q )  -  f(x)  -  fn(x)  =  f(n)(C)(x  -  x0)n/nl,  5  6  X, 

where  X  denotes  the  interval  X  =  [min{ x,xQ} ,max{  x,xQ} ] ,  and  the  remainder  term 
R  (f,Xg?x)  is  expressed  in  Lagrange  form.  This  approximation  of  f(x)  by  fn(x) 
gives  rise  to  a  problem  of  error  estimation  which  can  be  solved  by  the  methods 
of  interval  analysis.  If  F^n^  is, an  interval  inclusion  of  the  real  function 
f(n),  then 

(1.4)  f(x)  -  fn(x)  =  Rn(f,x0;x)  e  F(n,(X)(x  -  x0)n/nl? 

this  allows  automatic  computation  of  guaranteed  error  bounds  by  the  use  of 
interval  arithmetic  [13],  [14]. 

In  order  for  Taylor  series  methods  to  be  useful  in  scientific 
computation,  it  must  be  possible  to  automate  the  calculation  of  the  normalized 
real  Taylor  coefficients 

(1.5)  c(i  +  1)  =  f(i)(x0)(x  -  Xq)1/!!,  i  =  1, . . . ,n-1, 
and  the  corresponding  interval  quantities 

(1.6)  C(i  +  1)  *  F(l)(X)(x  -  xQ)Vi!,  i  =  1,...,n-1. 

These  calculations  can  be  carried  out  by  means  of  well-known  recurrence 
relations  [1],  [13],  [14],  [17]  for  functions  defined  by  subroutines  or 
expressions  involving  arithmetic  operations  and  a  variety  of  standard  functions 
for  which  library  subroutines  are  available.  A  very  important  application  of 
automated  generation  of  Taylor  series  by  recursion  is  the  numerical  solution  of 
the  initial-value  problem  for  ordinary  differential  equations-  That  is,  it  is 
required  to  find  y  =  y(x)  =  (y 1 (x) ,y2(x) , . . . ,ym(x) )  such  that 

(1.7)  yi  “  ft(x»y)'  yi(xo)=Yio'  1  “ 

for  values  of  x  in  an  interval  containing  Xq  [1],  [3],  [6],  [13],  [14]. 

Another  application  of  the  methods  in  this  paper  is  to  the  automatic 
generation  of  interval  inclusions  of  real  functions  by  means  of  their  interval 
mean-value  and  Taylor  forms  [13],  [14],  [20].  Suppose,  for  example,  that  f(x) 
is  a  real  function,  such  as 

(1.8)  f(x)  =  (x  +  3)/(x2  +  2), 
which  can  be  evaluated  by  the  corresponding  egression 

(1.9)  f  :=  (x  +  3 )/(x**2  +  2); 

in  a  Pascal-SC  program.  An  interval  inclusion  F  of  f  on  an  interval  X,  for 
which 

(1.10)  f (X)  =  { £ ( x )  I  x  e  X}  C  F(X) 


178 


can  be  obtained  simply  by  declaring  the  variables  F  and  X  to  be  of  type 
INTERVAL,  and  then  evaluating  the  expression  corresponding  to  (1.9), 

(1.11)  F  :=  (X  +  3)/(X**2  +  2) 

using  interval  arithmetic,  a  standard  feature  of  Pascal-SC  [24] .  An  inclusion 
obtained  in  this  way  may  be  too  coarse  in  the  sense  that  F(X)  is  a  much  larger 
interval  than  needed  to  contain  f(X).  In  this  case,  an  interval  inclusion 
provided  by  the  mean-value  form 

(1-12)  F ^ (x)  =  f ( x )  +  F* (X) (X  -  X),  X  e  x, 

can  be  better,  particularly  if  the  width  of  X  is  not  large  [13],  [14],  [20].  in 

(1.12) ,  F'  denotes  an  interval  inclusion  of  the  derivative  f'  of  f;  F' (X)  is 
obtained  automatically  by  evaluating  (1.11)  with  F  and  X  of  type  ITAYLOR,  as 
will  be  explained  below.  Interval  inclusions  of  f  are  also  provided  by  Taylor 
forms  of  higher  order  [20] ,  in  general, 

(1.13)  F  (X)  =  Z  f(i)(x)(X  -  x)Vi!  +  F(n)(X)(X  -  x )  n/n ! ,  x  e  X. 

n  ,  * 


These  forms  can  be  generated  automatically  from  the  expressions  (1.9)  and  (1.11) 

by  the  use  of  types  TAYLOR  and  ITAYLOR,  respectively.  Recursive  generation  of 

real  and  interval  Taylor  coefficients  makes  possible  an  adaptive  method  for 

calculation  of  interval  inclusions  of  real  functions,  in  which  n  is  increased 

until  F  (X)  includes  F  .(X).  It  is  also  possible  to  reduce  the  width  of 
n  n—  i 

computed  inclusions  by  making  use"  of  the  fact  that  the  intersection  of  interval 
inclusions  is  likewise  an  interval  inclusion. 

Previous  implementations  of  automatic  generation  of  Taylor  coefficients 
in  computer  languages  such  as  FORTRAN  have  used  interpretation  [21]  or  pre¬ 
compilation  [11]  to  activate  the  necessary  subroutines  [17],  In  more  modern 
languages,  the  compiler  itself  can  be  used  to  produce  the  necessary  routines, 
leading  to  a  saving  of  programming  effort  and  an  increase  in  clarity  of  the 
source  code.  The  use  of  Pascal-SC,  a  language  of  this  type,  will  be  explained 
in  the  next  section. 

2.  PASCAL-SC.  The  method  for  automatic  generation  of  Taylor  series  given  in 
this  report  is  based  on  computation  with  the  coefficients  of  Taylor  polynomials 
of  arbitrary  length,  considered  as  specific  mathematical  entities.  This 
requires  that  the  language  support  i)  user  defined  data  types,  as  do  descendents 
of  ALGOL- 60  such  as  Pascal  and  ADA™  (ADA  is  a  trademark  of  the  U.S*  Department 
of  Defense)?  and  ii)  user  defined  operators,  as  do  ALGOL-68  and  ADA. 

Pascal-SC  [2]  is  an  extension  of  Pascal  which  provides  both  user-defined 
data  types  and  user-defined  operators.  This  paper  assumes  a  modest  familiarity 
with  standard  Pascal  [9] .  For  the  remainder  of  this  Section,  we  outline  some  of 
the  extensions  which  make  Pascal-SC  well  suited  to  the  applications  in  this 
paper.  The  reader  who  wishes  to  omit  the  discussion  of  programming  language 
issues  may  proceed  directly  ^to  the  definition  of  the  data  types  TAYLOR  and 
ITAYLOR  in  Section  3. 

Pascal-SC  was  developed  with  the  needs  of  scientific  computation  in  mind. 

It  is  an  implementation  of  Jensen  and  Wirth  Pascal  [9]  which  also  provides 
intervals,  complex  numbers,  complex  intervals,  as  built-in  elementary  scalar 


179 


data  types  [24] .  A  full  range  of  standard  operators  is  provided  to  manipulate 
the  elementary  scalar  data  types,  as  well  as  vectors  and  matrices  built  of  these 
types  [24] ♦ 

Standard  Pascal  supports  user-defined  data  types  built  from  elementary  data 
types.  This  feature  will  be  used  to  define  variables  of  type  TAYLOR  and  ITAYLOR 
(interval  Taylor)  in  Section  3. 

Pascal-SC  allows  the  user  to  define  operators.  Most  conputer  languages 
allow  programmers  to  define  functions,  subroutines,  or  procedures,  but  except 
for  APL,  the  languages  most  often  used  for  scientific  computation  require  that 
such  user-defined  functions  be  called  using  a  prefix  notation  (eg.  SIN  (X)), 
while  built-in  operators  are  called  using  an  infix  notation  (eg.  A  +  B). 
Programmers  can  define  operators  in  Pascal-SC  to  extend  the  language  in  a 
uniform  way,  retaining  the  familiar  infix  notation  for  operators  whose  operands 
are  variables  of  user -defined  types  (eg.  A  +  B,  where  A  and  B  are  variables  of 
type  TAYLOR) . 

Operators,  functions,  and  procedures  in  Pascal-SC  can  be  overloaded.  That 
is,  the  name  of  an  operator,  a  function,  or  a  procedure  can  have  different 
meanings,  depending  on  the  type  or  number  of  its  operands.  For  exanple,  the 
standard  Pascal  or  FORTRAN  operator  "+"  is  said  to  be  overloaded  because  "A  +  b" 
for  integer  variables  A  and  B  has  a  different  meaning  from  "A  +  B"  for  real 
variables  A  and  B.  The  support  of  Pascal-SC  for  overloading  of  user  defined 
operators  is  essential  to  the  uniform  extension  of  the  language  because  we  wish 
to  define  the  meaning  of  "A  +  B"  for  variables  which  represent  Taylor  series 
with  real  or  with  interval  coefficients. 

The  support  of  Pascal-SC  for  user-defined  operators  and  for  overloading  is 
very  similar  to  that  provided  by  ADA.  ADA's  PACKAGE  concept  would  allow  a  more 
secure  iir?>lementation  of  data  abstractions  [10]  for  real  and  interval  valued 
Taylor  series.  The  operations  on  intervals,  however,  also  require  support  for 
directed  rounding  of  floating-point  results  in  order  to  guarantee  that  the 
desired  answer  is  contained  in  the  interval  computed.  The  early  implementations 
of  ADA  do  not  provide  an  accuracy  of  floating-point  computations  which  can 
compete  with  Pascal-SC. 

Pascal-SC  features  a  highly  accurate  arithmetic  based  on  a  general  theory 
[12]  for  real  and  complex  numbers,  real  and  complex  intervals,  and  vectors  and 
matrices  over  these  types.  Operations  on  floating-point  numbers  are  rounded  to 
the  closest  floating-point  number  to  the  true  result,  or  upward  or  downward  to 
the  closest  neighboring  floating-point  number  under  the  control  of  the  user. 

This  accuracy  meets  the  proposed  IEEE  standard  for  floating-point  arithmetic 
[15].  In  addition,  scalar  products  of  vectors 

N 

(2.1)  SC ALP ( A, B, ROUND)  *  l  A.*B. 

i=1  1  1 


are  calculated  with  the  same  accuracy  (to  the  closest  floating-point  number), 
and  with  the  same  options  for  rounding  [24] .  A  sufficiently  long  accumulator  is 
used  to  store  intermediate  results  in  the  evaluation  of  the  scalar  (or  inner) 
product.  This  capability  can  also  be  used  to  obtain  results  of  the  same  high 
accuracy  in  evaluation  of  a  given  arithmetic  expression  so  that  I.Oe+99  +  1.0E- 
99  -  I.OE+99  yields  1.0E-99. 
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3,  TYPES  TAYLOR  AND  ITAYLOR.  We  wish  to  provide  the  developer  of  scientific 
software  with  a  set  of  tools  with  which  Taylor  series  methods  can  be  implemented 
easily  for  a  variety  of  numerical  problems*  The  ability  of  the  conputer  to 
perform  formula  translation  is  used.  Compilers  since  the  first  FORTRAN  compiler 
have  produced  machine  code  for  the  evaluation  of  an  expression  such  as 

(3.1)  F  :=  (X*Y  +  SIN(X)  +  4.0)  *  (3,0  *  (Y**2)  +  6.0). 

This  is  done  by  analysis  of  the  expression  and  application  of  the  rules  for 
evaluation  of  formulas.  If  the  rules  for  differentiation  or  recursive 
generation  of  Taylor  coefficients  are  applied  in  the  same  way,  then  code  for  the 
evaluation  of  the  corresponding  quantities  results  [17].  In  this  way,  fast  and 
inexpensive  operations  performed  by  the  compiler  avoid  the  overhead  involved  in 
invoking  symbolic  differentiation  software.  This  leads  to  a  more  efficient 
implementation  of  Taylor  series  generation  all  the  way  from  initial  coding 
through  program  execution. 

The  normalized  Taylor  coefficients  of  a  function  f(x)  expanded  at  x  =  xQ 
are  defined  by 

(3.2)  f  •  TC  [K  +  1]  =  f (K) (xQ)tK/K! ,  t  =  x  -  xQ,  X  =  0,1,2,...  . 

Then 

(3.3) 


where  DIM  is  the  length  of  the  truncated  series  which  is  actually  stored.  This 
real  or  interval  vector  of  normalized  Taylor  coefficients  is  the  basis  for  the 
data  types  TAYLOR  and  ITAYLOF.  For  the  remainder  of  this  paper,  the  term 
"series"  is  used  to  refer  to  the  Taylor  polynomial  given  by  equation  (3.3)  or 
its  interval  analog. 

In  what  follows,  the  general  rule  will  be  adopted  that  all  variables  or 
expressions  of  the  scalar  types  INTEGER,  REAL,  or  INTERVAL  are  treated  as 
constants  for  the  purposes  of  differentiation. 

To  form  the  real  data  type  TAYLOR,  the  DIM  normalized  Taylor 
coefficients  in  (3.3)  are  stored  as  a  vector  of  floating-point  numbers.  The 
appropriate  declarations  in  Pascal-SC  are: 

CONST  DIM  -  n?  {  User  supplies  n  } 

TYPE  DIMTYPE  =  1..DIM; 

(3.4)  KVECTOR  =  ARRAY [DIMTYPE]  OF  REAL; 

TAYLOR  =  RECORD  LENGTH  :  DIMTYPF ; 

T  :  REAL; 

TC  :  P VECTOR  END; 

These  declarations  are  the  same  as  those  given  in  [19],  except  for  the  field 
named  LENGTH.  Let  F  be  a  variable  of  type  TAYLOR  (declared  by:  VAR  F: 

TAYLOR),  then  F, LENGTH  denotes  the  actual  length  of  the  truncated  series  (1  <_ 

F. LENGTH  <_  DIM).  It  may  happen  that  F. LENGTH  <  DIM  if  F  is  being  built  up 
recursively,  if  F  has  been  defined  by  term-by-term  differentiation  of  another 
series,  or  if  F  has  been  defined  as  a  quotient  of  two  series  both  of  whose 
leading  terms  are  zero  (see  Section  4.2),  This  field  has  been  added  to  the 
record  for  type  TAYLOR  given  in  [19]  for  internal  documentation  and  so  that 


DIM 

fDIM(X>  =  Z  f>TC[K1' 


181 


* 

i)  only  series  terms  actually  used  need  to  be  processed;  and  ii)  1' Hospital's 
rule  can  be  applied  to  certain  indeterminant  fbrms  0/0  which  may  appear. 

The  normalized  Taylor  coefficients  themselves  are  stored  in  the  array  of 
real  numbers  named  TC,  that  is, 

(3.5)  F.TC[K]  =  F(K"1)(X0)(X  -  X0)(K-1)/Ki,  K=1,...,DIM. 

The  size  of  the  step  being  used  for  expansion  is  F*T  =  X  -  XQ,  Series  are 
generated  using  a  fixed  stepsize  for  which  the  series  might  even  be  divergent. 
The  series  for  F  at  a  different  point  Z  is  readily  computed  at  a  cost 
proportional  to  DIM: 

(3.6)  F.TC [K]  :=  F.TC[K]*((Z  -  XQ)/F.T) ) **(K-1) ;  X  =  2,...,DIM, 

2 

while  the  cost  of  series  generation  is  usually  proportional  to  DIM  ,  The 
presence  of  the  stepsize  in  the  record  also  makes  it  possible  to  check  that  an 
operation  is  not  being  performed  on  two  series  with  different  stepsizes. 

One  of  the  important  problems  to  which  interval  analysis  has  been  applied 
since  its  beginnings  is  the  problem  of  controlling  the  truncation  error  of 
Taylor  series  methods  [13].  Hence  it  is  natural  to  support  Taylor  series  whose 
normalized  coefficients  are  intervals.  The  appropriate  declarations  in  Pascal- 
SC  are 

CONST  DIM  »  n;  {  User  supplies  n  } 

TYPE  DIMTYPE  -  1..DIM; 

INTERVAL  =  RECORD  INF,  SUP  :  REAL  END; 

(3.7)  IVECTOR  =  ARRAY [DIMTYPE]  OF  INTERVAL; 

ITAYLOR  =  RECORD  LENGTH  :  DIMTYPE; 

T  :  REAL; 

TC  :  IVECTOR  END; 

The  types  ITAYLOR  and  TAYLOR  are  the  same,  except  that  the  normalized 
coefficients  of  the  former  are  intervals.  The  same  recurrence  relations  are 
used  to  generate  series  of  each  type. 

The  stepsize  T  remains  real.  This  corresponds  to  bounding  the  range  of 
values  of  a  function  f  at  one  real  number  x.  There  are  some  applications  for 
which  it  is  necessary  to  bound  the  range  of  f  on  an  interval,  as  in  (1.13).  In 
this  case,  one  can  take  T  =  1  and  form  the  normalized  coefficients  by  computing 
the  needed  powers  of  (X  -  x)  by  interval  arithmetic,  or  else  introduce  a  new 
data  type  in  which  T  is  of  type  INTERVAL,  and  a  set  of  operators  corresponding 
to  those  given  here. 

The  declarations  (3.6)  and  (3.7)  of  types  TAYLOR  and  ITAYLOR,  respectively, 
are  basic  to  the  discussion  of  operators  in  the  next  section. 

4.  IMPLEMENTATION  OF  OPERATORS  AND  FUNCTIONS  FOR  TYPES  TAYLOR  AND  ITAYLOR.  As 
indicated  above,  the  ability  of  a  compiler  to  perform  formula  translation  can 
also  be  used  to  produce  machine  code  for  the  evaluation  of  the  normalized  Taylor 
coefficients  [1],  [3],  [11],  [13],  [14],  [16],  [17],  [19].  If  the  value  of 
function  f  is  obtained  by  a  composition 

(4.1)  £  -  f,  o  f2  o  ...  o  f. 
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of  a  finite  number  of  elementary  functions,  then  derivatives  of  f  can  be 
computed  by  the  chain  rule  from  the  derivatives  of  This  is  a  tedious 

and  error-prone  calculation  to  do  by  hand,  but  the  computer  does  it  not  only 
rapidly,  but  also  accurately. 

Recurrence  relations  for  calculating  the  normalized  Taylor  coefficients  for 
the  basic  arithmetic  operations  and  for  the  elementary  functions  are  well  known 
(see  117],  for  example).  Hence  machine  code  can  be  generated  to  expand  the 
Taylor  series  for  f  at  any  point  x  =  Xg  at  which  f  is  analytic.  These 
recurrence  relations  are  both  more  efficient  and  more  accurate  than  numerical 
differentiation  [18],  Recursive  generation  of  the  series  may  be  mildly  unstable 
[6] ,  but  the  interval-valued  Taylor  series  introduced  in  Section  3  can  give 
guaranteed  bounds  for  the  effect  of  any  such  instability.  In  Section  6,  we  show 
that  any  instability  in  the  series  generation  has  no  significant  effect  on  the 
series  sum. 

Rail  [19]  outlines  an  approach  to  abstract  data  types  for  real  and 
interval-valued  Taylor  series.  Our  implementation  generally  follows  that 
outline.  This  paper  discusses  extensions  and  some  of  the  inclement at ion 
details.  The  operators  and  functions  implemented  are  listed  in  Appendix  C. 
Source  code  in  Pascal-SC  is  given  in  the  report  [7] .  First  of  all,  in  order  for 
expressions  to  be  evaluated  correctly  when  they  include  variables  of  type  TAYLOR 
or  ITAYLOR,  the  arithmetic  operations  and  the  standard  functions  must  be  defined 
in  a  manner  which  incorporates  the  appropriate  recurrence  relations  for  the 
generation  of  the  normalized  Taylor  coefficients.  Our  implementation  in  Pascal- 
SC  attempts  to  follow  the  principles  of  uniformity,  compactness,  locality,  and 
linearity  for  a  good  programming  language  design  [23] .  Next  we  attempt  to 
justify  significant  departures  from  two  of  these  principles. 

The  principle  of  uniformity  in  programming  language  design  says  that  the 
same  things  should  be  done  the  same  way  whenever  they  occur.  Thus  "A  +  Bn  means 
"add",  regardless  of  the  types  of  the  variables  A  and  B.  The  other  arithmetic 
operators  enjoy  the  same  uniformity,  but  the  standard  functions  do  not.  For 
example,  exp(x)  is  EXP(X)  if  X  is  REAL,  IEXP(X)  if  X  is  INTERVAL,  TEXP(X)  if  X 
is  TAYLOR,  and  ITEXP(X)  if  x  is  ITAYLOR.  EXP  and  IEXP  are  built-in  functions 
which  were  designed  to  suggest  the  type  of  their  operand  and  result  as  an  aid  to 
reading  the  code.  That  is  especially  useful  since  Pascal  tends  to  violate  the 
principle  of  locality  by  placing  the  declaration  of  a  variable  far  from  its  use. 
We  chose  to  maintain  uniformity  of  our  extensions  with  the  built-in  functions. 

It  is  important  to  be  able  to  determine  the  type  of  a  variable,  and  it  would  be 
quite  non-uniform  if  IEXP  were  the  only  function  in  this  family  which  requires  a 
prefix. 

The  principle  of  locality  suggests  that  all  relevant  parts  of  the  program 
are  found  in  the  same  place.  We  attempt  to  follow  this  principle  in  each  of  our 
program  units,  but  the  use  of  the  global  constant  DIM  and  the  global  types 
RVECTOR,  IVECTOR,  TAYLOR,  and  ITAYLOR  is  a  violation.  The  use  of  such  global 
types  needed  in  the  headings  of  the  operators  and  functions  is  very  difficult  to 
avoid.  Their  use  has  the  advantage  that  all  of  the  information  about  the  length 
of  the  series  to  be  used  is  located  in  only  one  place,  CONST  DIM  =  n,  so  it  is 
easy  to  change. 

In  roughly  their  order  of  inportance,  the  goals  of  this  implementation 

are: 


183 


o  Consistent  set  of  software  tools, 

o  Correct  answer  whenever  possible. 

o  Useful  error  messages  when  no  correct  answer  is  possible, 
o  Readable  code  for  future  adaptations, 
o  Efficient  execution, 

o  Compact  code. 

For  example,  this  implies  that  although  efficient,  contact  code  is  sought, 
efficiency  and  compactness  are  sometimes  sacrificed  for  higher  goals.  In 
particular,  it  is  important  that  other  programmers  be  able  to  read  the  code, 
perhaps  in  order  to  improve  its  efficiency. 

Binary  operations  with  one  operand  of  type  TAYLOR  may  appear  with  the  other 
operand  of  type  INTEGER,  REAL,  or  TAYLOR?  and  the  two  operands  may  appear  in 
either  order.  Similarly,  binary  operations  with  one  operand  of  type  ITAYLOR  can 
have  a  second  operand  of  type  INTEGER,  INTERVAL,  or  ITAYLOR.  The  operators 
built  into  Pascal-SC  do  not  support  the  mixing  of  REAL  and  INTERVAL  operands 
because  real  numbers  are  viewed  as  being  potentially  inexact  [24] .  Our 
extensions  of  the  arithmetic  operators  to  interval  valued  Taylor  series  maintain 
uniformity  with  this  convention.  This  is  recognized,  but  not  explicitly  stated 
in  [19] .  If  a  programmer  Is  certain  that  a  real  number  X  is  exact  so  that  it 
may  safely  be  mixed  with  an  interval,  INTPT  (X)  converts  X  into  the  interval  [X, 
X]  . 


The  library  of  subroutines  to  support  confutations  with  types  TAYLOR  and 
ITAYLOR  includes  operators  (+,-,*,/,**),  special  power  functions  (sqr,  sqrt, 
ejq>),  standard  functions  (sin,  cos.  In,  arctan),  and  additional  functions  (tan 
and  the  Runge  function  f(x)  -  1/(1  +  x2),  to  which  the  user  can  add  more 
functions  and  procedures  as  desired.  The  analytic  operations  of  term-by-term 
differentiation  of  real  and  interval  series,  as  well  as  term-by-term 
differentiation  of  interval  series  are  also  provided  by  means  of  functions  for 
the  given  purpose.  There  is  also  a  set  of  utility  functions  and  procedures  to 
perform  frequently  needed  tasks,  such  as  reading  and  writing  real  and  interval 
series,  taking  the  midpoints  of  the  coefficients  of  an  interval  series  to  obtain 
a  real  series,  and  so  on. 

The  following  abbreviations  are  used  in  the  code  to  make  it  as  easy  as 
possible  to  locate  a  desired  operation  with  any  text  editor: 

K  INTEGER 

R  REAL 

I  INTERVAL 

T  TAYLOR 

IT  ITAYLOR. 

Using  these  abbreviations  to  distinguish  between  instances  of  overloading,  the 
operators  which  are  needed  to  support  variables  of  type  TAYLOR  and  ITAYLOR 
are: 


Addition  (Section  4.1): 

+  T,  K  +  T,  T  +  K,  R  +  T,  T  +  R,  T  +  T 

+  IT,  K  +  IT,  IT  +  K,  I  +  IT,  IT  +1,  IT  +  IT 
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Subtraction  (Section  4.1): 


-  T,  K  -  T, 

T 

- 

R  - 

T, 

T  - 

R, 

T 

- 

T 

-  IT,  K  -  IT, 

IT 

-  K, 

I  - 

IT, 

IT  - 

1/ 

IT 

- 

IT 

Multiplication  (Section  4 

.2): 

K  *  T, 

T 

*  K, 

R  * 

T, 

T  * 

R, 

T 

* 

T 

K  *  IT, 

IT 

*  K, 

I  * 

IT, 

IT  * 

I, 

IT 

* 

IT 

Division  (Section  4,2) 

* 

K  /  T, 

T 

/ 

R  / 

T, 

T  / 

R, 

T 

/ 

T 

K  /  IT, 

IT 

/  K, 

I  / 

IT, 

IT  / 

I, 

IT 

/ 

IT 

Power  ( Section  4.3): 

K  **  K,  R  ** 

K, 

K  ** 

R, 

R 

**  R 

I  ** 

K, 

K  ** 

I, 

I 

**  I 

K  **  T,  T  ** 

K, 

R  ** 

T, 

T 

*♦  R, 

T 

** 

T 

K  **  IT,  IT  ** 

K, 

I  ** 

XT, 

IT 

**  I, 

IT 

** 

IT 

Implementation  details  of  each  operator  are  discussed  in  the  Sections  shown. 
Pascal-SC  provides  no  power  operator,  so  **  must  be  defined  for  the  scalar  types 
before  it  can  be  extended  to  types  TAYLOR  and  ITAYLOR.  The  discussion  of  **  is 
postponed  to  follow  the  introduction  in  Section  4.3.1  of  special  cases  of 
exponentiation:  sqr,  sqrt,  and  exp. 

The  priorities  of  the  operators  given  in  this  Section  are: 

Highest:  Unary  addition  and  subtraction,  functions? 

Multiplication,  division,  and  powers:  *,  /,  ** 

Lowest:  Binary  addition  and  subtraction:  +, 

In  particular,  note  that  the  priority  of  **  relative  to  *  and  /  is  different 
than  in  FORTRAN. 

For  types  TAYLOR  and  ITAYLOR,  inp lamentation  has  been  provided  for  the 
Standard  functions  which  are  supported  in  Pascal-SC  for  types  INTEGER,  REAL,  and 
INTERVAL.  They  are: 

Special  powers  (Section  4.3,1): 

TSQR  (  T),  TSQRT  (  T),  TEXP  (  T) 

ITSQR  (IT),  ITSQRT  (IT),  ITEXP  (IT) 

Standard  functions  (Section  4.4): 

TSIN  (  T),  TCOS  (  T),  TLN  (  T),  TARCTAN  (  T) 

ITSIN  (IT),  ITCOS  (IT),  I TLN  (IT),  ITARCTAN  (IT) 

Additional  functions  (Section  4.5): 

TRUNGE  (  T),  TTAN {  T) 

ITRUNGE  (IT),  ITTAN(IT) 

Differentiation  and  integration  (Section  4.6): 

TDIFF (  T),  TINTGRL (  T) 

ITDIFF(IT),  ITINTGRL(IT) 
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Miscellaneous  utilities  (Section  4,7): 

VRNULL,  T_IDENT_ZERO(  T),  T_IDENT_CONSTANT (  T),  ITMIDPT(IT), 

IVRNULL,  IT_IDENT__ZERO(IT) ,  IT_IDENT_CONSTANT ( IT ) ,  WRITE_SERIES (  T), 
READ_INTERVAL  SERIES  (IT),  WRITE_INTEFVAL (  I),  WRITE_INTERVAL_SERIES  ( IT ) 

A  brief  description  of  the  method  for  introduction  of  user-defined 
functions  will  be  given  in  Section  4.5,  Some  implementation  details  of  the 
operators  and  functions  will  now  be  discussed.  The  recurrence  relations  are 
taken  from  [17],  In  following  the  conventions  of  Pascal-SC,  minor  differences 
from  the  indices  found  there  are  due  to  our  starting  the  series  indices  at  1 
instead  of  starting  at  0.  In  each  Section,  operations  involving  the  scalar 
types  are  discussed  before  turning  to  types  TAYLOR  and  ITAYLOR.  The  Pascal-SC 
source  code  for  the  operators,  functions,  and  utilities  listed  in  Appendix  C  are 
given  in  [7] , 


4.1,  Addition  and  subtraction. 
operators  are  quite  straightforward. 


The  ten  addition  and  ten  subtraction 


Addition: 

+  Tf  K  +  T,  T  +  K, 
+  IT,  K  +  IT,  IT  +  K, 


R  +  T,  T  +  R, 
I  +  IT,  IT  +  I, 


T  +  T 
IT  +  IT 


Subtraction : 

-  T,  K 

-  IT,  K 


T,  T 
IT,  IT 


K,  R  -  T,  T  -  R,  T 
K,  I  -  IT,  IT  -  I,  IT 


T 

IT 


Addition  and  subtraction  of  a  constant  alters  only  the  value  of  a  variable,  not 
the  values  of  any  of  its  derivatives.  Interval  constants  only  require  that  the 
appropriate  built-in  interval  operator  be  used.  Otherwise,  addition  or 
subtraction  of  series  is  done  term-by-term. 

If  U  :=  F  +  G,  then 


(4.1.1)  U.TC [K]  :=  F.TC [K]  _+  G.TC[K] ,  K  =  1,...,DIM. 

4.2.  Multiplication  and  Division. 

Multiplication: 

K  *  T ,  T  *  K,  R  *  T,  T*R,  T  *  T 

K  *  IT,  IT  *  K,  I  *  IT,  IT  *  I,  IT  *  IT 

Multiplication  and  division  of  two  Taylor  series  is  done  by  the  well- 
known  Leibniz  rule  for  the  Taylor  coefficients  of  a  product  [17] . 

If  U  F  *  G,  then 

K 

(4.2.1)  U.TC[K]  =  £  F.TC [I] *G.TC [K-I+ 1] ,  K  =  1,...,DIM. 

1-1 

The  scalar  product  of  two  vectors  is  evaluated  in  Pascal-SC  by  the  standard 
function  SCALP  to  the  closest  floating  point  number.  Fast  series  multiplication 
techniques  were  not  used  here  because 


o  In  many  applications  of  *,  the  series  for  U  is  being  generated 

recursively.  That  is,  the  variables  F  or  G  involve  U  itself, 
o  The  accuracy  of  SCALP  would  not  be  available. 

o  The  speed  of  SCALP,  especially  when  some  terms  are  zero,  makes 

these  techniques  less  attractive. 
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Multiplication  or  division  of  a  series  by  a  constant  is  done  term-by- 
term.  Division  of  a  constant  by  a  series  is  done  by  generation  of  the  series 
for  C/F ( x ) . 


Division: 

K/T,  T  /  K,  R  /  T,  T/R,  T/T 
K  /  ITf  XT  /  K,  I  /  IT,  IT  /I,  IT  /  IT 

If  U  F  /  G,  then  U  *  G  -  F,  and  Leibniz'  rule  applies: 

U.TCtl]  =  F .  TC  [  1  ]  /  G.TC[1], 


(4.2,2) 


for  K  —  2,  ... ,DIM, 


K-1 

U.TC[K]  -(  l  U.TC[I]  *G.TC[K-I+1]  )  /G.TC[1], 
1=1 


If  G (Xg )  =  G • TC [ 1 ]  =  0,  then  we  attempt  return  the  correct  answer  whenever 
possible.  If  f(xq)  =  F ,TC  [  1 ]  is  also  0,  then  we  can  apply  X* Hospital's  rule 
because  the  series  for  both  F  and  G  are  known.  U.TC[1]  =  F' (xQ )/G' (xQ )  * 
F.TC[2]/G.TC[2],  if  this  quotient  exists,  but  U  f  F'/G'  as  functions. 

If  U  :=  F/G,  and  F(xQ)  *  G(x0)  »  0,  then  let 

(4.2.3)  V.TC [K]  : —  F . TC [K+ 1 ] ?  W.TC  [K]  :=  G.TC [K+ 1 ] ;  K=  1,...,DIM-1. 

Then, 

(4.2.4)  U  :=  V  /  W. 

Thus,  1' Hospital's  rule  is  implemented  as  a  recursive  call  to  the  division 
operator  with  operands  whose  series  length  has  been  reduced  by  one.  This 
approach  would  not  be  possible  in  a  language  which  does  not  support  recursion. 
Further,  cases  in  which  the  series  for  both  f  and  g  have  several  leading  zeros 
are  handled  automatically  by  the  language. 

L* Hospital's  rule  is  applied  in  a  similar  manner  when  a  constant  quotient 
or  divisor  is  equal  to  zero. 

4.3.  Power  Operators.  The  power  operator  **  defined  by  F  **  G  =  F  is 
not  standard  in  Pascal  or  Pascal-SC,  but  can  be  implemented  in  the  latter  for 
data  types  for  which  it  is  meaningful  by  the  use  of  the  operator  concept. 

Coding  of  **  is  simplified  by  the  introduction  of  a  set  of  basic  power 
functions.  These  are  implemented  separately 

o  for  uniformity  with  Pascal-SC  which  provides  these  functions 
for  standard  data  types, 

o  to  provide  tighter  bounds  for  interval  operands,  and 
o  for  efficiency. 

4.3.1.  Special  Power  Functions  This  set  of  functions  consists 
of  the  square,  square  root,  and  natural  exponential  function  of  variables  of 
types  TAYLOR  and  ITAYLOR: 

TSQR  (  T),  TSQRT  (  T),  TEXP  (  T) 

ITSQR  (IT),  ITSQRT  (IT),  ITEXP  (IT) 
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These  functions  are  called  by  the  operator  **  when  appropriate.  For  example,  if 
X  is  of  type  ITAYLOR,  then  both  X  **  2  and  X  **  INTPT(2.0)  are  actually 
performed  by  a  call  to  ITSQR  (X).  The  use  of  this  function  rather  than  X  *  X  is 
important  in  interval  computations,  since,  for  example,  [-1,+1]  =  [0,1]  while 

[-1,+1]  *  [-1,+1]  =  Further,  the  squaring  functions  TSQR  and  ITSQR  are 

twice  as  fast  as  the  multiplication  Y  *  Y  for  variables  of  the  corresponding 
types. 

The  recurrence  relations  to  generate  the  series  terms  for  these  functions 
can  be  derived  easily  using  Leibniz'  rule.  The  squares  of  real  and  interval 
Taylor  series  are  computed  as  follows. 

If  U  :=  SQR(F),  then  Leibniz'  rule  for  a  product  can  be  shortened  to: 

For  K  =  1,  ...,  DIM, 

K  DIV  2 

(4.3.1)  U.TC[K]  =  Z  F.TC[I] *F.TC[K-I+1] ; 

I  =  1 

if  K  is  odd,  then  U.TC[K]  =  U,TC[K]  +  SQR  (F .TC [ (K+1 )/2] ) . 

The  inner  product  contains  only  TRUNC(K/2)  terms.  If  F  Ls  of  type  ITAYLOR  and 
includes  negative  numbers,  then  ITSQR  (F)  provides  tighter  bounds  than  does  F  * 
F.  The  SQR  functions  are  named  TSQR  and  ITSQR  to  indicate  the  type  of  operand 
accepted  and  value  returned. 

A  similar  function  was  written  for  CUBE.  Its  summations  had  length 
TRUNC(K/3)  but  they  were  nested  to  yield  a  cost  proportional  to  DIM3,  CUBE  is 
not  included  in  the  library  because  F  *  SQR  (F)  is  faster. 

The  functions  in  the  next  set  calculate  square  roots  of  real  and  interval 
Taylor  variables - 

If  U  :=  SQRT  (F),  then  U  *  U  -  F.  The  algorithm  runs  as  follows: 

U . TC [ 1 ]  :=  SQRT  (F.TC[1] ); 

U.TC[2]  :=  F .TC [2]  /  (2  *  U.TC[1])j 
for  K  —  3,  . . . ,  DIM, 

(4.3.2)  K  DIV  2 

PROD  :=  X  U.TC [I] *U .TC [K-I+1 ] ; 

I  ~  2 

if  K  is  odd,  then  PROD  PROD  +  SQR  (U.TC [ (K+1 )/2] ) ; 

U . TC [K]  (F .TC [K]  -  PROD)  /  (2  *  U.TC[1]). 

If  F(xq)  =  0,  and  F  is  not  a  constant  series,  then  SQRT  (F)  cannot  be  computed 
unless  F  is  the  constant  0,  because  F'(xQ)  is  not  defined.  The  SQRT  functions 
are  named  TSQRT  and  ITSQRT  to  indicate  the  type  of  operand  accepted  and  value 
returned. 

The  natural  exponential  functions  (base  =  e)  are  now  defined  for  types 
TAYLOR  and  ITAYLOR, 

If  U  :=  EXP  (F),  then  U'  =  U  *  F'.  This  gives  the  algorithm: 
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U.TC  C 1 1  ;  =  EXP  (F*TC [ 1 ]  ); 


for  K  =  2/ . . . ,DIM, 


(4.3.3) 


K-1 

U.TC[K]  (  l  U.TC[I]*F.TC[K-I+1]*(K-I)  )  /  (K-1); 
1=1 


Although  this  formula  would  appear  slieflitly  simpler  with  the  change  of  index  J  = 
K  -  If  it  was  implemented  in  this  way  so  that  the  U.TC  terms  remain  stationary 
in  the  inner  product  as  K  increases.  Thus,  only  the  vector  F.TC  needs  to  be 
"reversed".  The  EXP  functions  are  named  TEXP  and  ITEXP  to  indicate  the  type  of 
operand  accepted  and  value  returned. 

4.3.2.  The  Operator  **.  The  family  of  power  operators  **  seems 
to  be  the  most  difficult  to  implement  as  suggested  by  (5] .  None  of  the 
operators  are  especially  difficult,  but  there  are  many  minor  details  to  be 
considered.  The  implementation  of  **  for  types  TAYLOR  and  ITAYLOR  is  based  on 
the  standard  power  functions  above,  and  the  power  operators  **  for  the  scalar 
types  INTEGER  (K) ,  REAL  (R),  and  INTERVAL  (I). 

Scalar  Powers; 

K  **  K,  R  **  K,  K  **  R,  R  **  R 

I  **  K,  K  **  I,  1**1 

Integer  powers  are  in^lemented  using  repeated  squaring.  Real  and  interval 
powers  which  fit  no  special  case  are  computed  by  F  **  G  =  EXP  ( G  *  LN ( F ) ) .  We 
have  not  attenpted  optimal  implementations  of  the  scalar  power  operators  because 
it  is  hoped  that  they  will  be  provided  as  standard  operators  in  a  later  release 
of  Pascal-SC,  an  approach  that  is  especially  attractive  for  interval  operands 
because  the  interpreter  hides  information  from  programmers  which  can  be  used  for 
correctly  directed  roundings. 

Real  and  Interval  Taylor  Powers; 

K  **  T,  T  **  K,  R  **  T,  T  **  R,  T  **  T 

K  **  IT,  IT  **  K,  I  **  IT,  IT  **  I,  IT  **  IT 

The  power  operators  for  a  constant  to  a  variable  power  follow  the  pattern 
of  TEXP  or  ITEXP,  as  appropriate.  A  series  which  represents  a  constant  (only 
its  first  term  is  non-zero)  is  handled  as  a  special  case  for  accuracy 
(especially  for  interval  series)  and  for  efficiency. 

The  operators  T  **  K  and  IT  **  K  take  care  to  return  the  correct  answer 
whenever  that  is  possible  and  to  produce  an  appropriate  error  message  when  it  is 
not  possible.  The  resolution  of  various  cases  is  shown  in  Table  4.1. 

Consider  a  series  whose  first  term  is  zero,  but  which  has  other  terms  which 
are  non-zero.  Raising  such  a  series  to  a  negative  power  is  undefined  because  it 
is  equivalent  to  dividing  by  zero,  but  raising  such  a  series  to  the  power  0 
defines  a  function  which  is  identically  equal  to  1,  except  for  a  removable 
singularity  at  x  =  Xq.  Hence  it  is  appropriate  to  give  1  as  the  answer. 

Raising  the  series  to  a  positive  integer  power  is  implemented  by  repeated 
squaring  because  the  recurrence  relation  which  is  most  often  used  [21]  requires 
division  by  BASE.TC[1],  which  is  zero. 
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Exponent: 


0 


1 


2 


>2  <  0 


Base.TC 

=  0 

+ - - - 

1 

I  Undef. 

— +- 

=  0 

=  0 

=  0 

Undef. 

1 

l 

1 

Base.TC [1] 

=  0 

1 

|  1 

1 

1 

1 

( 

=  BASE 

1 

1 

SQR 

(BASE) 

By  mult. 

Undef. 

1 

1 

I 

1 

1 

I 

1 

1 

1 

Base.TC [ 1 ] 

A 

V 

O 

1 

|  1 

1 

1 

1 

| 

=  BASE 

I 

1 

1 

SQR 

( BASE ) 

By  recurrence 

1 

+ - — — 

! 

— +. 

1 

-+- 

_ 

Table  4.1,  Resolution  of  Cases  for  **. 


The  special  cases  of  an  exponent  equal  to  1  or  2  are  singled  out  for 
individual  treatment  in  order  to  achieve  the  maximum  possible  accuracy 
(especially  when  the  base  is  an  interval  series)  and  for  efficiency. 

Except  in  the  special  cases  shown  in  the  table,  if  U  -  F  **  E,  where  E  is 
of  type  INTEGER,  then  F  *  U'  -  E  *  U  *  F*.  This  gives  the  algorithm 

U.TC  :=  0; 

U • TC  [  1 ]  :=  F . TC [ 1 ]  **  E; 

(4.3.4)  For  K  2  to  DIM, 

K-  1 

U.TC [K]  :=  (  £  (E* (K-I )  -  I  +  1)  *  U.TC[I]  *  F.TC[K-I+1])  /  ( (K- 1 ) *F.TC [I]  ) . 

1=1 


The  integer  exponent  appears  in  the  recurrence  only  as  a  multiplier.  Hence  the 
speed  of  this  algorithm  is  nearly  independent  of  the  size  of  the  exponent.  That 
is  why  this  algorithm  is  preferred  to  repeated  squaring. 

The  operators  T  **  R  and  IT  **  I  are  similar  to  T  **  K  and  IT  **  K, 
respectively,  except  that  the  additional  special  cases  of  an  exponent  equal  to 
1/2  or  to  an  integer  are  handled. 

The  operators  T  **  T  and  IT  **  IT  are  included  primarily  for  completeness? 
the  authors  have  never  seen  a  differential  equation  containing  a  variable  to  a 
variable  power,  for  example.  Perhaps  any  such  problems  which  arise  are  at  once 
simplified  by  logarithmic  differentiation.  With  the  tools  described  here,  it 
may  be  advantagious  to  attack  the  problem  in  its  original  form. 

Within  the  operators  T  **  T  and  IT  **  IT,  the  cases  in  which  either  the 
base  or  the  exponent  series  represent  a  constant  function  are  treated  as  special 
for  reasons  of  accuracy  and  efficiency.  Otherwise,  F  **  G  =  EXP  (G  *  LN  ( F ) ) , 
using  TEXP  and  TLN  or  ITEXP  and  ITLN,  as  appropriate. 

4.4.  Standard  Functions.  There  are  many  useful  library  functions  which 
could  be  provided.  We  have  chosen  to  implement  the  functions  which  are  built 
into  Pascal-SC  for  the  standard  scalar  data  types,  and  a  few  others.  Additional 
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functions  can  be  added  as  they  are  needed  by  following  the  models  provided  by 
this  paper,  in  addition  to  the  standard  power  functions  of  Section  4.3.1  (which 
include  EXP  and  IEXP),  other  standard  functions  in£>lemented  for  types  TAYLOR  and 
ITAYLOR  are: 


TSIN  (  T)  ,  TCOS  (  T),  TLN  (  T),  TARCTAN  (  T) 

ITSIN  (IT),  ITCOS  (IT),  I TLN  (IT),  ITARCTAN  (IT) 

If  U  -  sin  (F)  and  V  =  cos  (F),  then  U'  =  V  *  F*  and  V'  =  -  U  *  F*. 

U.TC[1]  :=  SIN  (F.TCtl])?  V.TC[1]  :=  COS  (F.TC[1])y 

for  X  2, • *  * , DIM, 

K 

(4.4.1)  U.TC[K]  :=  (  Z  V.TC [I]  *  F.TC[K-I+1]  *  (K-I)  )  /  (K- 1 ) ; 

1=2 


K 

V.TC[K]  :=  -  (  Z  U.TC[I]  *  F.TC[K-I+1]  *  (K-I)  )  /  (K-1). 
1=2 


The  SIN  and  COS  functions  are  named  TSIN  and  TCOS  or  ITSIN  and  ITCOS  to 
indicate  the  type  of  operand  which  they  accept  and  value  they  return.  Since  the 
series  for  SIN  and  COS  are  always  confuted  together,  the  library  also  contains 
procedures  T_SIN_COS  and  IT__SIN_COS  which  return  both  the  SIN  and  COS  of 
variables  of  type  TAYLOR  and  ITAYLOR,  respectively,  in  the  same  call. 

If  U  :=  In  (F),  then  U'  *  F  =  F*. 

U •  TC  [  1  ]  :=  LN  ( F •  TC  [  1  ]  )? 

(4.4.2)  for  K  :  =  2,...,DIM, 

K-1 

U.TC[K]  :  =  (F.TCtK]  -  (  Z  U.TC[I]  *  F.TC[K-I+1]  *  (1-1)  )  /  (K-1))  /F.TC[1]. 

1=2 

There  is  a  misprint  in  this  recurrence  relation  in  ([17],  p.  42),  but  its 
implementation  is  straightforward. 

If  U  :=  arctan  (F)  and  V  :=  1  /  (1  +  F2 ) ,  then  U*  =  V  *  F ' . 

V  :=  1  /  (  1  +  SQR  (F)  ) y 
U.TC  [  1 ]  !=  ARCTAN  (  F.TCfl]  )y 

(4.4.3)  for  K  :  =  2 , • . , ,DIM, 

K 

U.TC [K]  :=  (  Z  V.TC [I]  *  F.TC[K-I+1]  *  (K-I)  )  /  (K-1). 

1=2 

Since  the  series  for  the  Runge  function  V(F)  ([8],  p,  78)  is  required  to 
compute  the  series  for  arctan(F),  functions  TRUNGE  and  ITRUNGE  are  included  in 
the  library  along  with  the  functions  TARCTAN  and  ITARCTAN. 
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4.5.  User  Defined  Functions.  If  a  programmer  requires  an  operation  or 
a  function  which  is  not  included  in  this  report,  the  requirement  can  be  met 
either  by  a  composition  of  operators  and  functions  which  are  already  provided, 
or  by  a  careful  derivation  of  the  necessary  recurrence  relations  following  the 
models  in  this  report.  For  example,  the  tangent  functions  TTAN(  T)  and 
ITTAN(IT)  are  implemented  essentially  by 


(4.5.1) 


TTAN (  T)  :=  TSIN (  T)  /  TCOS (  T), 
ITTAN(IT)  ITSIN(IT)  /  ITCOS(IT), 


respectively,  in  the  set  of  additional  functions  provided  in  the  library.  The 
tangent  functions  can  also  be  inp lamented  directly  by  recurrence  relations, 
using  the  fact  that  y  =  tan(x)  satisfies  the  differential  equation 

(4.5.2)  y*  =  1  +  y2,  y(x0)  ~tan(xQ), 

[13],  [14],  Thus,  for  U  :=  TTAN (  T),  for  example, 

(4.5.3)  U * TC [ 1 ]  :=  TAN (T.TC [ 1 ]  ) ,  U.TC [2]  (1  +  SQR(T.TC [ 1 ]  ) )  *  T.T. 

The  succeeding  coefficients  can  be  obtained  in  a  simple  way  from  the  recurrence 
relation  (4.3.1)  for  TSQR(  T),  and  I TTAN ( IT )  is  confuted  analogously. 

2 

The  Runge  function  f(x)  -  1  /  (1  +  x  ),  which  is  an  auxiliary  function  for 
the  series  expansion  of  the  arctangent,  is  implemented  in  the  library  by 


U  :=a#  TSQR( IT )  y 

(4,5.4)  U •  TC  [  1  ]  1  +  U.TC  [  1 ]  ? 

TRUNGE  :=  1  /  U. 


ITRUNGE  is  computed  similarly. 


4.6.  Differentiation  and  Integration.  Functions  which  return  the 
results  of  term-by-term  differentiation  and  integration  of  TAYLOR  and  ITAYLOR 
series  are  also  provided.  For  series  with  1  <  LENGTH  <_  DIM,  differentiation 
decreases  the  length  of  the  series  by  one: 


(4.6.1)  U.TC [K]  :  =  T.TC[K  +  1]  *  RATIO  /  (K  +  1 ) ,  K 


1, . . . ,T. LENGTH  -  1, 


where  RATIO  =  1  /  T.T  if  U  =  TDIFF  (T),  and  RATIO  =  1  /  INTPT  (T.T)  if  U  = 

ITDIFF  (T).  Integration  results. in  a  series  with  its  first  coefficient  set  to  0 
and  its  length  increased  by  one: 


(4.6.2)  U.TC [K]  :=  T.TC[K  -  1]  *  RATIO  /  (K  -  1),  K  =  2, ... ,T .LENGTH, 


with  U.TC [ 1 ]  =  0  and  RATIO  =  T.T  for  U  -  TINTGRL(T),  while  U.TC[1]  =  INTPT(O) 
and  RATIO  -  INTPT (T.T)  for  U  =  ITINTGRL(T).  The  result  of  integration  of  a 
series  of  length  DIM  will  be  truncated  to  length  DIM. 


4.7.  Miscellaneous  Utilities.  Some  useful  functions  and  procedures  are 
provided  for  convenience.  These  are  the  transfer  function  TMIDPT  (IT),  the 
special  functions  VRNULL,  IVRNULL,  the  comparison  functions  T__IDENT_ZERO, 
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T_I DENTjCONSTANT ,  IT_IDENT_ZERO ,  I  T_I  DENTJCONSTANT,  and  the  input/output 
procedures  WRITE_INTERVAL  (I),  WRITE^SERIES  (T),  READ_INTERVAL_SERIES  (IT), 
WRITE_INTERVAL_SERIES  (IT),  The  purposes  of  most  of  these  utilities  are 
indicated  by  their  names. 

The  transfer  function  TMIDPT  (IT)  forms  a  TAYLOR  series  from  a  series  of 
type  ITAYLOR.  The  coefficients  of  the  result  series  are  the  midpoints  of  the 
corresponding  coefficients  of  the  interval  series. 

The  parameterless  functions  VRNULL,  IVRNULL  yield  zero  real  and  interval 
vectors,  respectively,  of  length  DIM.  They  are  standard  in  Pascal-SC. 

The  compar ison  functions  yield  the  BOOLEAN  value  TRUE  if  their  argument 
satisfies  the  stated  condition  (the  series  is  identically  equal  to  zero  or  a 
constant),  otherwise,  FALSE. 

The  input/output  procedures  are  also  self-explanatory.  The  procedure  WRITE 
INTERVAL  is  included,  since  the  standard  Pascal-SC  procedure  IWRITE  only  prints 
the  digits  of  the  lower  and  upper  endpoints  of  intervals  which  agree  up  to  the 
last  [-24]  .  WRITE_INTERVAL,  however,  prints  all  digits  of  each  endpoint. 

5.  THE  INITIAL-VALUE  PROBLEM  FOR  ORDINARY  DIFFERENTIAL  EQUATIONS.  Taylor 
series  methods  for  the  numerical  solution  of  initial-value  problems  for  systems 
of  ordinary  differential  equations  have  been  studied  by  many  authors  (see  [6]  or 
[14]  for  summaries),  and  have  been  used  for  applications  such  as  dynamics  and 
parameter  identification.  Each  component  of  the  solution  of 

(5,1)  yi’  =  f±(x,y),  y±(x$)  =  yi0,  i=  1,...,m, 

is  expressed  as  a  Taylor  series  expanded  at  x  -  xQ  using  recurrence  relations 
derived  from  the  differential  equation.  Various  error  control  strategies  have 
been  employed.  The  strategy  of  analyzing  the  radius  of  convergence  of  each 
component  series  has  the  desirable  side  effect  of  producing  such  analytic 
information  as  the  location  and  orders  of  the  singularities  in  the  solution. 

Once  the  radius  of  convergence  is  known,  a  steps ize  can  be  chosen  which  is  as 
large  as  possible  subject  to  error  control  and  stability  constraints.  Then  each 
component  of  the  solution  is  extended  by  analytic  continuation  and  the  process 
is  repeated  at  the  next  integration  step.  This  algorithm  is  discussed  in 
greater  detail  in  [6] . 

A  program  RDEQ_SOLV  for  solving  equation  (5.1)  is  given  as  Appendix  A  of 
this  report.  The  program  is  written  for  the  case  m  =  1,  but  can  be  modified 
easily  to  handle  systems  of  several  equations.  The  variables  Y  and  YPRIME  are 
declared  to  be  of  type  TAYLOR,  and  the  equation  is  written  in  a  natural  way.  To 
solve  a  different  equation,  it  is  only  necessary 

o  to  change  the  line  in  RDEQ_SOLV  which  contains  the  differential 
equation; 

o  to  copy  from  the  library  into  the  source  program  any  operators 
or  functions  required  by  the  new  differential  equation. 

Because  the  differential  equation  is  written  using  the  types  and  operators 
discussed  in  the  preceding  Sections,  the  needed  recurrence  relations  are 
implemented  by  the  Pascal-SC  compiler  and  need  not  be  derived  explicitly  by  the 
user. 
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The  program  prints  the  series  terms,  extends  the  solution  by  analytic 
continuation  to  compute  the  initial  condition  at  the  next  step,  and  repeats  the 
process.  The  program  RDEQ_SOLV  in  Appendix  A  is  intentionally  simple  to 
illustrate  the  use  of  the  Taylor  operators  and  to  explore  the  stability  of  the 
series  generation.  It  would  require  an  error  control  mechanism  in  order  to  be 
of'  practical  value  for  the  solution  of  initial  value  problems.  Either  scalar 
[6]  or  interval  [13]  error  control  techniques  can  be  used. 

The  program  IDEQ_SOLVE  listed  as  Appendix  B  of  this  report  computes 
interval -valued  approximate  solutions  to  equation  (5.1)  for  the  case  m  —  1,  but 
can  be  modified  for  systems  of  several  equations.  It  differs  from  the  program 
RDEQ_SOLV  only  in  that 

i)  the  variables  Y  and  YPRIME  are  of  type  ITAYLOR  instead  of  type 
TAYLOR,  and 

ii)  additional  code  has  been  added  to  monitor  the  relative  error 
introduced  by  instability  in  the  series  generation  process. 

These  two  programs  are  designed  to  serve  as  examples  of  one  way  in  which 
the  tools  of  this  report  can  be  used.  They  are  simple,  menu-driven  programs 
which  allow  direct  user  intervention  at  each  integration  step.  By  observation 
of  the  outcome  of  each  step,  the  user  can  experiment  with  error  control 
strategies. 

The  bounds  computed  by  IDEQ^SOLV  are  for  the  interval-valued  Taylor 
polynomial  (3.3),  They  are  not  global  error  bounds  for  the  solution  of  the 
differential  equation.  Global  error  bounds  are  readily  computable  using 
interval  remainder  terms  (see  [13] ),  but,  for  simplicity,  the  programs  given 
here  contain  no  error  bounding  or  control  strategy. 

6.  AN  APPLICATION:  STABILITY  OF  SERIES  GENERATION.  In  this  section,  we 
present  an  example  which  uses  the  Taylor  and  interval  Taylor  operators.  This 
example  was  chosen  because  it  illustrates  the  uses  of  these  operators  and 
because  it  addresses  the  issue  of  stability  in  the  generation  of  the  series.  The 
latter  issue  is  central  in  showing  that  Taylor  series  methods  are  reliable  for 
practical  computations . 

A  numerical  computation  is  said  to  be  unstable  if  its  relative  error  grows 
without  bound  as  the  computation  proceeds.  It  is  possible  that  the  recurrence 
relations  being  used  might  be  unstable,  although  instability  has  never  been 
observed  in  practice.  This  example  uses  the  Taylor  and  interval  Taylor 
operators  to  explore  the  stability  of  the  recurrence  relations  in  one 
application.  In  this  example,  there  is  instability  in  the  generation  of  the 
terms  of  the  series,  but  that  does  not  seriously  affect  the  accuracy  of  the 
series  summation.  The  stability  of  the  recurrence  relations  in  other 
applications  can  be  handled  similarly. 

Consider  the  initial  value  problem 
(6.1)  y'  “  y2,  y(0)  -  1, 

whose  solution  is  y(x)~1/(1-x).  A  program  (RDEQ_SOLV)  for  solving 
equation  (6.1)  using  the  Taylor  function  TSQR  is  given  as  Appendix  A  of  this 
report.  The  effect  of  program  FDEp__SOLV  is  to  generate  the  normalized  Taylor 
coefficients  (4.3.1)  of  the  solution  recursively.  This  recurrence  is 
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accomplished  automatically  by  the  Taylor  function  TSQR  in  the  statement 
YPRIME  :=  TSQR  (Y).  In  this  case,  the  same  solution  is  obtained  if  Y  **  2  or 
Y  *  Y  is  used  instead  of  TSQR  (Y);  however,  the  use  of  Y  **  2  requires  the 
compilation  of  much  more  code,  while  Y  *  Y  is  not  as  fast  as  TSQR  (Y). 

We  wish  to  explore  the  stability  of  the  recurrence  relation  (4.3.1),  This 
issue  is  separate  from  the  issue  of  the  stability  of  Taylor  series  methods  for 
solving  initial  value  problems.  If  an  infinite  Taylor  series  were  used,  the 
method  would  be  A-stable,  but  when  a  truncated  series  is  used,  the  region  of 
stability  is  bounded.  Stetter  [22]  showed  that  the  region  of  stability  for 
truncated  Taylor  series  methods  is  the  same  as  that  for  related  Runge-Kutta 
methods.  The  real  interval  of  stability  is  relatively  large  here  because  long 
series  are  used.  For  example,  the  real  intervals  of  stability  are  [-8.85,  0] 
and  [-16.29,  0],  respectively,  if  DIM  =  20  and  40  terms  of  the  series  are 
used. 


We  will  outline  the  theoretical  analysis  of  the  stability  of  recurrence 

(4.3.1).  A  more  complete  discussion  appears  in  [6],  Let  U(K)  denote  the  actual 
and  Y(lO  denote  the  computed  normalized  Taylor  coefficients.  Let  Y(1)  = 

U(1)  (1  +  e)  =  (1  +  e)  from  (6.1).  Then  U(K)  =  hK'\  and 

(6.2)  Y(K)  =  u(K)  (1  +  e)Kf 

so  the  series  generation  is  unstable.  However,  the  summation  of  the  series  is 
unaffected  by  this  instability  since 


(6.3) 


DIM  DIM 

y  (x1 )  =  I  Y  (K)  =  (1  +  e)  Z  (h  ( 1  +  e))K~ 
K=1  K— 1 


=  (1  +  e)  y (h( 1  +  e))  +  o(hDIM) 

=  (1  +  e)(y(h)  +  y'(S)h)  +  0(hDIM),  h  <  5  <  h ( 1  +  e). 

That  is,  the  instability  in  the  series  generation  is  equivalent  to  a  small  error 
in  the  point  at  which  the  series  is  evaluated.  This  is  because  ZRY (K)  is  a 
convergent  series,  so  the  terms  for  which  instability  causes  the  relative  error 
to  be  largest  are  themselves  very  small. 

This  suggests  using  interval  arithmetic  to  keep  track  of  the  potential 
growth  In  the  series.  The  program  IDEQ_SOLV  listed  as  Appendix  B  of  this  report 
does  so. 


By  declaring  Y  and  YPRIME  to  be  of  type  ITAYLOR,  the  statement  YPRIME  := 
ITSQR(Y)  invokes  the  function  ITSQR  for  interval  valued  series  to  generate 
interval  normalized  Taylor  coefficients  according  the  recurrence  relation 

(4.3.1) .  The  lengths  of  successive  coefficients  measure  the  stability  of  the 
recurrence.  Table  6-1  shows  the  interval  valued  series  solution  of  equation 

(6.1)  for  DIM  =  15,  y(xQ)  =  y(0)  =  [0.99  ,  1.01]  (e  =  +_  0.01),  and  h  =  0.5. 
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READ  INTERVAL  INITIAL  CONDITIONS  XO,  Y(X0): 


INITIAL  CONDITIONS  AT  XO  =  [  O.OOOOQE+OO,  0 . 0000 OE+OO ] , 

Y0  =  [  9.90000E-01 ,  1 . 0 1 OOOE+OO ] . 

ENTER  STEPSI2E  X  -  XO:  0.5 
Computing  series  terms  . . . 


.ep 

Left 

Right 

Computed 

Theoretical 

Endpoint 

Endpoint 

Instability 

Instability 

1 

[  9.90000E-01, 

1.01 OOOE+OO ] 

1.01 0E+00 

1.010E+00 

2 

[  4.90050E-Q1, 

5. 10050E-01] 

1 .020E+00 

1 .020E+00 

3 

[  2.42575E-01, 

2 . 57575E-0 1 ] 

1 » 030E+00 

1 . 030E+Q0 

4 

[  1.20075E-01, 

1 .30076E-01] 

1 .040E+00 

1.041E+00 

5 

[  5.94369E-02, 

6. 5688 IE-02 ] 

1.050E+00 

1 . 051E+00 

6 

[  2.94213E-02, 

3.3 1 725E-02] 

1 . 060E+00 

1.062E+00 

7 

[  1.45635E-02, 

1 . 6752  IE-02] 

1. 070E+00 

1. 072E+00 

8 

[  7 . 20894E-03 , 

8.45982E-03] 

1 .080E+00 

1 .083E+00 

9 

[  3. 56843E-03, 

4. 27221E-03] 

1 , 090E+00 

1. 094E+00 

10 

[  1.76637E-03, 

2.15747E-03] 

1 . 100E+00 

1. 105E+00 

1  1 

[  8. 74354E-04, 

1.08952E-03] 

1. 110E+00 

1.116E+00 

12 

[  4.32805E-04, 

5 . 50208E-04] 

1 . 1 19E+00 

1 . 127E+00 

13 

[  2. 1 4239E-04 , 

2.77855E-04] 

1. 129E+00 

1. 138E+00 

14 

[  1.06048E-04, 

1 .40317E-04] 

1. 139E+00 

1 . 149E+00 

15 

[  5.24938E-05, 

7. 08599E-Q5] 

1. 149E+00 

1 . 161E+0Q 

THE  VALUE  AT  X  =  [  5.00000E-01,  5.00000E-01] 

IS  Y  =  [  1.96034E+00,  2. 04033E+00 ] . 


Table  6.1.  Interval  bounds  for  instability. 


The  computed  instability  is  equal  to 


(6.4) 


E 


Computed 


length  (Y,TC[K]) 
midpoint  (Y.TC(K]) 


a  measure  of  the  relative  error  in  Y.TC(K)  which  appears  to  grow  as  K 
increases.  The  theoretical  instability  is  equal  to 


(6.5) 


E 


Theoretical 


(1  +  e) 


K. 


Table  6.1  shows  that  these  two  values  are  very  close,  and  that  the  theoretical 
bound  is  slightly  larger  than  the  actual  bound,  as  it  should  be*  The  interval 
estimate  for  y(0.5)  agrees  well  with  the  interval  [y(0.46),  y(0.54)]  =  [1.8518, 
2.1739] . 


7 .  IMPLEMENTATION  DETAILS .  The  software  described  in  this  report  was  created 
and  tested  using  the  Pascal-SC  compiler  developed  at  the  University  of  Karlsruhe 
for  the  Zilog  MCZ-1  microcomputer  with  the  RIO  2.06  operating  system.  No  other 
claims  of  correctness  or  usability  are  made. 


196 


REFERENCES 


1.  D.  Barton,  I*  M.  Willers,  and  R.  V.  M.  Zahar.  Taylor  series  methods  for 
ordinary  differential  equations  -  An  evaluation.  In  Mathematical 
Software,  John  Rice  (Ed.).  Academic  Press,  New  York,  1971,  369-390. 

i 

2.  G.  Bohlender,  K.  Gruner,  E.  Kaucher,  R.  Klatte,  W.  Kramer,  U.  W. 
Kulisch,  S.  M.  Rump,  Ch.  Ullrich,  J.  Wolff  von  Gudenberg,  and  W. 

L.  Miranker.  Pascal-SC:  A  Pascal  for  contemporary  scientific 
computation.  Research  Report  RC  9009,  IBM  Thomas  J.  Watson  Research 
Center,  Yorktown  Heights,  N.  Y. ,  1981. 

3.  Y.  F.  Chang.  Automatic  solution  of  differential  equations.  In 
Constructive  and  Confutations 1  Methods  for  Differential  and  Integral 
Equations,  D.  L.  Colton  and  R.  P.  Gilbert  (Eds.).  Lecture  Notes  in 
Mathematics,  Vol  430,  Springer-Verlag,  Berlin-Heidelberg-New  York, 

1974,  61-94. 

4.  Y.  F,  Chang,  M.  Tabor,  J.  Weiss,  and  G.  F,  Corliss.  On  the  structure 
of  the  Henon  Heiles  system.  Phys.  Lett.  A  85A  (1981),  211-213. 

5.  w.  J.  Cody  and  W.  Waite.  Software  Manual  for  the  Elementary  Functions. 
Prentice-Hall,  Englewood  Cliffs,  N.  J. ,  1980. 

6.  George  Corliss  and  Y.  F.  Chang,  Solving  ordinary  differential  equations 
using  Taylor  series.  ACM  Trans.  Math.  Soft.  8  (1982),  114-144. 

7.  George  Corliss  and  L.  B.  Rail.  Automatic  Generation  of  Taylor  Series  in 
Pascal-SC:  Basic  Operations  and  Applications  to  Ordinary  Differential 
Equations.  MRC  Technical  Summary  Report  No.  2497,  Mathematics  Research 
Center,  University  of  Wisconsin-Madison,  1983. 

8.  Philip  J.  Davis.  Interpolation  and  Approximation.  Blaisdell,  New  York, 
1963, 

9.  K,  Jensen  and  N.  Wirth.  Pascal  User  Manual  and  Report,  2nd  Ed. 
Springer-Verlag,  Berlin-Heidelberg-New  York,  1974. 

10.  Carlo  Ghezzi  and  Mehdi  Jazayeri.  Programming  Language  Concepts. 

Wiley,  New  York,  1982. 

11.  G.  Kedem.  Automatic  differentiation  of  computer  programs.  ACM  Trans. 
Math.  Soft.  6  (1980),  150-165. 

12.  U.  W.  Kulisch  and  W.  L  Miranker.  Computer  Arithmetic  in  Theory  and 
Practice.  Academic  Press,  New  York,  1981, 

13.  R.  E.  Moore.  Interval  Analysis.  Prentice-Hall,  Englewood  Cliffs, 

N.  J. ,  1966. 


14.  R.  E.  Moore.  Methods  and  Applications  of  Interval  Analysis.  SIAM 
Studies  in  Applied  Mathematics,  2,  Philadelphia,  1979. 


197 


15.  J.  F,  Palmer.  VSLI  and  the  revolution  in  numeric  computation.  Pro¬ 
ceedings  of  the  10th  IMACS  World  Congress  on  System  Simulation  and 
Scientific  Computation,  Vol.  1,  pp.  339-341.  Montreal,  1982. 

16.  L.  B.  Rail.  Applications  of  software  for  automatic  differentiation  in 
numerical  confutation.  Computing,  Suppl.  2  (1980),  141-156. 

17.  L.  B.  Rail.  Automatic  Differentiation:  Techniques  and  Applications. 
Lecture  Notes  in  Conputer  Science  No.  120,  Springer-Verlag, 

Berlin -Heidelberg-New  York,  1981. 

18.  L.  B.  Rail.  Differentiation  in  Pascal-SC:  'Type  GRADIENT.  Technical 
Summary  Report  No.  2400,  Mathematics  Research  Center,  University  of 
Wisconsin-Madison,  1982. 

19.  L.  B.  Rail.  Differentiation  and  generation  of  Taylor  coefficients 
in  Pascal-SC.  Technical  Summary  Report  No.  2452,  Mathematics 
Research  Center,  University  of  Wisconsin-Madison,  1982. 

20.  L.  B.  Rail.  Mean-value  and  Taylor  forms  in  interval  analysis. 

SIAM  J.  Math.  Anal.  14  (1983),  no.  2,  223-238. 

21.  Allen  Reiter.  Automatic  generation  of  Taylor  coefficients  (TAYLOR) 
for  the  CDC  1604.  Technical  Summary  Report  No.  830,  Mathematics 
Research  Center,  University  of  Wisconsin-Madison,  1967. 

22.  H.  J.  Stetter.  Analysis  of  Discretization  Methods  for  Ordinary 
Differential  Equations.  Springer-Verlag,  Berlin-Heidelberg-New  York, 
1973. 

* 

23.  G,  Weinberg.  The  Psychology  of  Computer  Programming.  Van  Nostrand 
Reinhold,  New  York,  1971, 

24.  J.  Wolff  von  Gudenberg.  Gesamte  Arithmetik  des  PASCAL-SC  Rechners: 
Benutzerhandbuch.  institute  for  Applied  Mathematics,  University 
of  Karlsruhe,  1981. 


198 


APPENDIX  A 


PROGRAM  RDEQ_SOLVE  (INPUT,  DATA,  OUTPUT); 

(*  SOLVE  A  FIRST  ORDER  DIFFERENTIAL  EQUATION:  Y*  =  SQR  (Y)  *) 


CONST  DIM  =  30? 

TYPE  DIMTYPE  =  1..DIM? 

R  VECTOR  =  ARRAY [DIMTYPE]  OF  REAL? 
TAYLOR  =  RECORD  LENGTH  :  DIMTYPE? 

T  :  REAL; 

TC  :  R VECTOR 


CHOICE  =  1 . .3 ; 


END; 


VAR  FLAG  :  CHOICE? 

I,  IM1  :  DIMTYPE? 

X,  Y,  YPRIME  :  TAYLOR? 

DATA  :  TEXT; 

FUNCTION  VRNULL  :  RVECTOR; 

VAR  I:  DIMTYPE?  U:  RVECTOR? 

BEGIN 

FOR  I  :=  1  TO  DIM  DO  U[I]  0.0? 
VRNULL  :=  U 

END?  (*  FUNCTION  VRNULL  *) 
FUNCTION  TSQR  (T;  TAYLOR)  :  TAYLOR; 

(*  Requires:  VRNULL,  SCALP,  SQR 

VAR  I,  J,  K,  HALF:  DIMTYPE? 

X,  Y:  RVECTOR? 

U  :  TAYLOR? 

BEGIN 

X  ;=  VRNULL;  Y  :=  VRNULL? 

U. LENGTH  :=  T. LENGTH; 

U.T  :=  T.T; 


U.TC  :=  VRNULL? 

U.TC[1]  :=  SQR  (T.TC[1]); 

X[1]  :=  T.TC [ 1 ] ? 

FOR  K  :=  2  TO  U. LENGTH  DO 
BEGIN 

X[K]  : »  T.TC  [K]  ? 

HALF  :=  K  DIV  2? 

FOR  J  1  TO  HALF  DO 
BEGIN 

I  K  —  J  +  1? 

Y[J]  T.TC [I] 

END;  (*  FOR  J  *) 

U.TC [K]  :=  2.0  *  SCALP  (  X,  Y,  0)? 

IF  K  MOD  2  =  1  THEN 
BEGIN 

HALF  :=  HALF  +  1? 

U.TC [K] U.TC [K]  +  SQR  (T.TC [HALF]) 
END  (*  IF  *) 

END;  (*  FOR  K  *) 

TSQR  :=  U 

END;  (*  FUNCTION  TSQR  (TAYLOR)  *) 


(*  TSQR(T)  *) 
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FUNCTION  MENU_C HO ICE  ;  CHOICE; 

VAR  I i  INTEGER; 

BEGIN 

WRITELN? 

WRITELN  (• ENTER;  1  -  GIVE  NEW  INITIAL  CONDITIONS’); 

WRITELN  (’  2  -  CONTINUE  EXTENDING  THE  SOLUTION'); 

WRITELN  (’  3  -  STOP'); 

READ  (I); 

IF  ((I  >=  3)  OR  (I  0))  THEN  I  :=  3; 

MENU_CHOICE  :=  I 

END;  (*  FUNCTION  MENU_CH0ICE  *) 

PROCEDURE  PRNT_TAY_COEF  (Y:  TAYLOR;  INDEX:  DIMTYPE); 

BEGIN 

WRITELN  ('Y(«,  INDEX : 5 ,  '  )  =  ',  Y.TC [INDEX]) 

END;  (*  PROCEDURE  PRNTJTAYJTOEF  *) 

FUNCTION  SUM  (VAR  A:  RVECTOR;  DIM,  ROUND:  INTEGER)  :  REAL? 

EXTERNAL  480; 

BEGIN  (*MAIN  PROGRAM  RDEQjSOLVE* ) 

(* .  INITIALIZE  *) 

FLAG  :=  2; 

X.  LENGTH  :=  DIM; 

Y.  LENGTH  :=  DIM; 

RESET  (DATA)? 

WHILE  FLAG  <—  2  DO  (*  LOOP  FOR  NEW  INITIAL  CONDITIONS  *) 

BEGIN 
FLAG  :=  2; 

X. TC  :=  VRNULL; 

Y. TC  :=  VRNULL? 

WRITELN  ('READ  REAL  INITIAL  CONDITIONS  X0,  Y(XO)t'); 

READ  (DATA,  X.TC[1])?  READ  (DATA,  Y.TC[1]); 

WRITELN ;  WRITELN ; 

WRITELN  (’INITIAL  CONDITIONS  AT  XO  =  ’ ,  X.TC[1],  ',')? 


WRITELN  ('  Y0  =  *,  Y.TC[1],  '.'); 

WHILE  FLAG  =  2  DO  (*  LOOP  FOR  ANALYTIC  CONTINUATION  *) 

BEGIN 

(* .  READ  STEP  SIZE  *) 


WRITELN  (’ENTER  STEPSIZE  X  -  X0:  '); 

READ  (X.T) ; 

Y.T  X.T; 

WRITELN  ('Computing  series  terms  ...*); 

FOP  I  :=  2  TO  DIM  DO  ( *  LOOP  FOR  SERIES  GENERATION  * ) 

BEGIN 

(*  YOUR  FIRST  ORDER  DIFFERENTIAL  EQUATION  GOES  HERE:  *) 

YPRIME  :=  TSQR  (Y); 

IMl  :=  1-1; 

Y . TC [ I ]  :=  YPRIME. TC[ IMl]  *  Y.T  /  IMl ; 

END;  ( *FOR* ) 

(* .  PPINT  TABLE  *) 

WRITELN;  WRITELN; 

WRITELN  ('THE  TAYLOR  COEFFICIENTS  OF  Y  ARE:'); 

FOR  I  :=  1  TO  DIM  DO  PRNT  TAY  COEF  (Y,  I); 


200 


PERFORM  THE  ANALYTIC  CONTINUATION  *) 


(* 


Y.TC[1]  SUM  (Y.TC,  DIM,  0) 
FOR  Is*  2  TO  DIM  DO  Y.TC [I]  s 
X.TC  [  1 ]  ;=  X.TC [ 1 ]  +  X.T; 
WRITELN; 

WRITELN  ('THE  VALUE  AT  X  = 
WRITELN  ( *  IS  Y  = 

FLAG  :=  MENUJCHOICE 
END  (  ♦WHILE*’ ) 

END  (*WHILE*) 

END.  (*  MAIN  PROGRAM  RDEQ  SOLVE  * ) 


t 

=  0.0; 

X • TC [ 1 ]  ) ; 
Y.TCI1], 
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APPENDIX  B 


PROGRAM  IDEQ_SOLVE  (INPUT,  DATA,  OUTPUT); 

(*  SOLVE  A  FIRST  ORDER  DIFFERENTIAL  EQUATION  *) 
(*  y  *  SQR  (Y)  *) 

(*  SOLUTION  IS  IN  INTERVAL  FORM  *) 


CONST  DIM  -  15; 

TYPE  DIMTYPE  -  1. .DIM; 

INTERVAL  =  RECORD  INF,  SUP  :  REAL  END; 
IVECTOR  -  ARRAY [DIMTYPE]  OF  INTERVAL; 


ITAYLOR  -  RECORD 


CHOICE  =  1 . . 3 ; 

VAR  FLAG 

I,  IMl 

X,  Y,  YPRIME 
DATA 
EPSILON, 
COMPOUND 


LENGTH  :  DIMTYPE; 

T  :  REAL; 

TC  :  IVECTOR  END ? 


:  CHOICE; 

:  DIMTYPE; 
:  ITAYLOR? 
;  TEXT? 

:  REAL? 


(*  Transfer  Functions  *) 

FUNCTION  INTPT  (  RA : REAL  )  i  INTERVAL; 

EXTERNAL  41; 

FUNCTION  INTVAL  (  RA , RB !  REAL  )  :  INTERVAL; 

EXTERNAL  42; 

FUNCTION  IINF  (  A:  INTERVAL)  ;  REAL; 

EXTERNAL  43? 

FUNCTION  ISUP  (  A:  INTERVAL)  :  REAL? 

EXTERNAL  44; 

(*  Comparisons  *) 

OPERATOR  <=  (A,B;  INTERVAL  )  RES:  BOOLEAN  ; 

EXTERNAL  48; 

OPERATOR  >=  (A,B:  INTERVAL  )  RES:  BOOLEAN  ; 

EXTERNAL  50; 

OPERATOR  IN  ( RA : REAL ;  B:  INTERVAL)  RES:  BOOLEAN? 
EXTERNAL  47? 

OPERATOR  IN  (KA:  INTEGER;  B:  INTERVAL)  RES:  BOOLEAN 
EXTERNAL  46; 

OPERATOR  ><  (A,B:  INTERVAL  )  RES:  BOOLEAN  ; 

EXTERNAL  52; 


(*  Lattice  Operators  *) 

OPERATOR  ++  (A,B:  INTERVAL)  RES:  INTERVAL; 
EXTERNAL  63; 

OPERATOR  **  (A,B :  INTERVAL)  RES:  INTERVAL? 
EXTERNAL  60; 
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(*  Arithmetic  Operators  *) 

OPERATOR  +  <  A;  INTERVAL  )  RES:  INTERVAL; 

EXTERNAL  67  ? 

OPERATOR  -  (  A:  INTERVAL  )  RES:  INTERVAL; 

EXTERNAL  66; 

OPERATOR  +  (  A,B:  INTERVAL  )  RES:  INTERVAL ; 

EXTERNAL  68; 

OPERATOR  +  (  KA:  INTEGER;  B:  INTERVAL  )  RES:  INTERVAL; 
EXTERNAL  69; 

OPERATOR  +  (  A:  INTERVAL;  KB:  INTEGER  )  RES:  INTERVAL; 
EXTERNAL  70; 

OPERATOR  -  (  A,B:  INTERVAL  )  RES:  INTERVAL; 

EXTERNAL  73; 

OPERATOR  -  (  KA:  INTEGER;  B:  INTERVAL  )  RES:  INTERVAL; 
EXTERNAL  75; 

OPERATOR  -  (  A:  INTERVAL;  KB:  INTEGER  )  RES:  INTERVAL ; 
EXTERNAL  74; 

OPERATOR  *  (  A ,B :  INTERVAL  )  RES:  INTERVAL; 

EXTERNAL  78? 

OPERATOR  *  (  KA;  INTEGER;  B:  INTERVAL  )  RES:  INTERVAL; 
EXTERNAL  79; 

OPERATOR  *  (  A:  INTERVAL;  KB:  INTEGER  )  RES:  INTERVAL; 
EXTERNAL  80; 

OPERATOR  /  (  A,B:  INTERVAL  )  RES:  INTERVAL; 

EXTERNAL  85; 

OPERATOR  /  (  KA:  INTEGER;  B:  INTERVAL  )  RES:  INTERVAL; 
EXTERNAL  83; 

OPERATOR  /  (  A:  INTERVAL;  KB:  INTEGER  )  RES:  INTERVAL; 
EXTERNAL  86; 

FUNCTION  ISCALP  (VAR  A,  B:  IVECTOR;  AKDIM  :  INTEGER)  : 
EXTERNAL  88? 


(*  Standard  Functions  * ) 

FUNCTION  IABS  (  Y:  INTERVAL  )  :  REAL; 
EXTERNAL  101; 

FUNCTION  ISQR  (  Y:  INTERVAL  )  ;  INTERVAL; 
EXTERNAL  102; 

FUNCTION  ISQRT  (  Y:  INTERVAL  )  :  INTERVAL; 
EXTERNAL  105; 

FUNCTION  I EXP  (  Y:  INTERVAL  )  :  INTERVAL ; 
EXTERNAL  106; 

FUNCTION  ILN  (  Y:  INTERVAL  )  :  INTERVAL? 
EXTERNAL  107? 

FUNCTION  IARCTAN  (  Y:  INTERVAL  )  :  INTERVAL? 
EXTERNAL  108; 

FUNCTION  ISIN  (  Y:  INTERVAL  )  :  INTERVAL? 
EXTERNAL  109; 

FUNCTION  ICOS  (  Y:  INTERVAL  )  :  INTERVAL? 
EXTERNAL  110; 


INTERVAL? 
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(*  Input  /  Output  *) 

PROCEDURE  I READ  {  VAR  F:TEXT?  VAR  A:  INTERVAL  ); 

EXTERNAL  92? 

PROCEDURE  I WRITE  (  VAR  Fi  TEXT;  A:  INTERVAL  ) ? 

EXTERNAL  91; 

FUNCTION  ISUM  (A:  IVECTOR?  DIM:  DIMTYPE)  :  INTERVAL? 

VAR  Bi  IVECTOR? 

I:  DIMTYPE; 

BEGIN 

FOR  I  : —  1  TO  DIM  DO  B[I]  :=  INTPT  (1.0); 

ISUM  :=  ISCALP  (A,  B,  DIM) 

END;  (*  FUNCTION  ISUM  * ) 

FUNCTION  IVRNULL  :  IVECTOR? 

VAR  I:  DIMTYPE?  U:  IVECTOR; 

BEGIN 

FOR  I  :*  1  TO  DIM  DO  U[I]  :=  INTPT  (0.0); 

IVRNULL  :=  U 

END?  (*  FUNCTION  IVRNULL  *) 

FUNCTION  ITSQR  (T:  ITAYLOR)  :  ITAYLOR;  (*  ITSQR(IT)  *) 

(*  Requires:  IVRNULL,  ISCALP,  ISQR  *) 

VAR  I,  J,  K,  HALF:  DIMTYPE? 

X,  Y:  IVECTOR; 

U  :  ITAYLOR; 

BEGIN 

X  :=  IVRNULL;  Y  :=  IVRNULL? 

U. LENGTH  :  =  T, LENGTH? 

U.T  :  =  T.T? 

U.TC  :=  IVRNULL? 

U.TC[1]  :=  ISQR  (T.TC  [1]  ) ; 

X[1]  T.TC  [  1 ] ? 

FOR  K  2  TO  U. LENGTH  DO 
BEGIN 

X[K]  :=  T.TC  [K]  ? 

HALF  K  DIV  2; 

FOR  J  :=  1  TO  HALF  DO 
BEGIN 

I  :=  K  -  J  +  1? 

Y [ J]  :=  T.TC  [I] 

END?  (*  FOR  J  * ) 

U.TC [K]  :=  2  *  ISCALP  (  X,  Y,  HALF); 

IF  K  MOD  2  =  1  THEN 
BEGIN 

HALF  :=  HALF  +  1? 

U.TC [K]  :=  U.TC [K]  +  ISQR  (T.TC [HALF]) 

END  (*  IF  *) 

END;  (*  FOR  K  *) 

ITSQR  U 

END?  (*  FUNCTION  ITSQR  (ITAYLOR)  * ) 
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FUNCTION  MENU  CHOICE  :  CHOICE? 


VAR  I:  INTEGER; 
BEGIN 

WRITELN? 

WRITELN  ( 'ENTER: 

1 

-  GIVE  NEW  INITIAL  CONDITIONS'); 

WRITELN  ( ' 

2 

-  CONTINUE  EXTENDING  THE  SOLUTION* ) 

WRITELN  ( 1 

READ  (I); 

3 

-  STOP')? 

H 

*1 

H 

V 

It 

GJ 

o 

SG 

(I  < 

=  0) )  THEN  I  :=  3; 

MENU_CHOICE  I 

END?  (*  FUNCTION  MENU  CHOICE  *) 


PROCEDURE  WRITE_INTERVAL  (INT:  INTERVAL); 

BEGIN 

WRITE  ('[’,  INT* INF: 12,  ' ,  ' ,  INT. SUP: 12,  ']')? 
END;  (*  PROCEDURE  WRITE  INTERVAL  *) 


PROCEDURE  PRNT_ITAY_COEF  (Y:  ITAYLOR;  INDEX:  DIMTYPE); 
BEGIN 

WRITE  CYC,  INDEX :5,  '  )  =  ')? 

WRITE_INTERVAL  ( Y. TC  [INDEX]  ) ? 

WRITELN 

END?  (*  PROCEDURE  PRNT  ITAY  COEF  *) 


FUNCTION  INTERVAL_LENGTH  (INT:  INTERVAL)  :  REAL? 
BEGIN 

INTERVAL_LENGTH  INT. SUP  -  INT. INF  * 

END;  (*  FUNCTION  INTERVAL  LENGTH  *) 


FUNCTION  RELATIVE_LENGTH  (INT:  INTERVAL)  :  REAL? 
BEGIN 

RELATIVE^! ENGTH  :=  2.0  *  (INT. SUP  -  INT. INF) 

/  (INT. SUP  +  INT. INF) 
END?  (*  FUNCTION  RELATIVE  LENGTH  *) 


FUNCTION  RELATIVE_ERROR  (INT:  INTERVAL)  :  REAL? 

BEGIN 

RELATIVE__ERROR  :=  2.0  *  INT. SUP  /  (INT. SUP  +  INT. INF) 
END?  (*  FUNCTION  RELATIVE  ERROR  *) 


BEGIN  (*  MAIN  PROGRAM  IDEQ_SOLVE  *) 

(* .  INITIALIZE  *) 

FLAG  :=  2; 

X.  LENGTH  :=  DIM? 

Y.  LENGTH  :=  DIM; 

RESET  (DATA); 
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(*  LOOP  FOR  NEW  INITIAL  CONDITIONS  *> 


WHILE  FLAG  <-  2  DO 
BEGIN 
FLAG  2; 


X. TC  :=  IVRNULL; 

Y. TC  ;  =  IVRNULL; 

WRITELN  ('READ  INTERVAL  INITIAL  CONDITIONS  XO ,  Y(X0):'); 

IREAD  (DATA,  X.TC[1])j  IREAD  (DATA,  Y.TC[1])? 

WRITELN?  WRITELN; 

WRITE  ( ' INITIAL  CONDITIONS  AT  XO  =  1 ) ? 

WRITE__INTERVAL  ( X . TC  [  1 ]  )  ?  WRITELN  (',')? 

WRITE  ( '  YO  —  *  ) ; 

WRITE_INTERVAL  (Y.TC [1 ] ) ?  WRITELN  CO; 

WHILE  FLAG  ”  2  DO  (*  LOOP  FOR  ANALYTIC  CONTINUATION 


*) 


BEGIN 

(* .  READ  STEP  SIZE  *) 

WRITELN  CENTER  STEPSIZE  X  -  XO:  *  )? 

READ  (X.T) ; 

Y.T  :=  X.T; 

WRITELN  (’Computing  series  terms  ...’)? 

FOR  I  :=  2  TO  DIM  DO  (*  LOOP  FOR  SERIES  GENERATION  *) 

BEGIN 

(*  YOUR  FIRST  ORDER  DIFFERENTIAL  EQUATION  GOES  HERE:  *) 

YPRIME  :=  ITSQR  (Y); 


IM1  :  =  I  -  1  ? 

Y.TC  [I]  YPRIME ,TC [IMl ]  *  INTPT  (Y.T  /  IM1  )  ; 
END ;  ( *FOF* ) 


(* .  PRINT  TABLE  *) 

EPSILON  :=  0.5  *  RELATIVE_LENGTH  ( Y.TC  1 1 ]  ) ? 

COMPOUND  :=  1.0; 

WRITELN  ?  WRITELN ; 

WRITELN  (’Step  Left  Right  Computed 

Theoretical* ) ? 

WRITELN  ( '  Endpoint  Endpoint  Instability 

Instability* ) ; 

WRITELN? 

FOR  I  :=  1  TO  DIM  DO  (*  LOOP  FOR  ERROR  MEASUREMENT  *) 

BEGIN 

COMPOUND  :=  COMPOUND  *  (1.0  +  EPSILON); 

WRITE  (1:3);  WRITE  (*  *)? 

WRITE^INTERVAL  ( Y . TC [ I ]  ) ; 

WRITE*” C  ',  RELATIVE_ERROR  ( Y.TC  [I]  )  :  10  )  ; 

WRITE  ( '  ' ,  COMPOUND : 1 0 ) ;  WRITELN 

END  ?  ( *FOF* ) 

( * .  PERFORM  THE  ANALYTIC  CONTINUATION  * ) 

Y.TC  [  1 ]  :=  ISUM  (Y.TC,  DIM); 

FOR  I  :~2  TO  DIM  DO  Y.TC [I]  INTPT  (0.0); 

X.TC [ 1 ]  :=  X.TC [ 1 1  +  INTPT  (X.T); 

WRITELN; 

WRITE  (  *THE  VALUE  AT  X  -  * ) ?  WRlTEJtNTERVAL  (X.TC  [1]) I  WRITELN; 
WRITE  ('  IS  Y  =  ');  WRITE_INTERVAL  (Y.TC[1])j  WRITELN  ( * . 

FLAG  :=  MENUJCHOICE 
END  ( *WHILE* ) 

END  ( *WHILE* ) 

END.  (*MAIN  PROGRAM  IDEQ  SOLVE*) 
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APPENDIX  C 


PASCAL-SC  REAL  AND  INTERVAL  TAYLOR  OPERATORS,  PROCEDURES,  AND  FUNCTIONS 

The  operators,  procedures,  and  functions  are  grouped  into  seven  files.  Source 
code  can  be  found  in  the  report  [7] . 


1.  RIT_ADD.LIB 

2.  RIT_MUL . LIB 

3.  RIT_D IV .LIB 

4.  RT_POW.LIB 

5.  ITJ POW.LIB 

6.  RIT_FNS . LIB 

7.  UTIL. LIB 


REAL  AND  INTERVAL  TAYLOR  ADD  AND  SUBTRACT  <<<<<< 
REAL  AND  INTERVAL  TAYLOR  MULTIPLY  <<<<<<<<<<<<< 
REAL  AND  INTERVAL  TAYLOR  DIVIDE  <<<<<<<<<<<<<<< 
REAL  TAYLOR  POWERS  <<<<<<<<<<<<<<<<<<<<<<<<< 
INTERVAL  TAYLOR  POWERS  <<<<<<<<<<<<<<<<<<<<<< 
REAL  AND  INTERVAL  TAYLOR  FUNCTIONS  <<<<<<<<<<<< 
UTILITY  PROCEDURES  &  FUNCTIONS  <<<<<<<<<<<<<<<< 


The  contents  of  each  library  are: 


C.1«  Addition  and  Subtraction  Operators 


RIT_ADD .LIB  -  REAL  AND  INTERVAL  TAYLOR  ADD  AND  SUBTRACT  <<<<<< 
+  T  OPERATOR  +  (T:  TAYLOR)  RES  :  TAYLOR; 

K  +  T  OPERATOR  +  (K:  INTEGER;  T:  TAYLOR)  RES  ;  TAYLOR; 

T  +  K  OPERATOR  +  (T:  TAYLOR;  K:  INTEGER)  RES  :  TAYLOR ; 

R  +  T  OPERATOR  +  (R:  REAL;  T:  TAYLOR)  RES  :  TAYLOR; 

T  +  R  OPERATOR  +  (T:  TAYLOR?  R;  REAL)  RES  :  TAYLOR? 

T  +  T  OPERATOR  +  (TA,  TB:  TAYLOR)  RES  ;  TAYLOR; 

+  IT  OPERATOR  +  (T:  ITAYLOR)  RES  :  ITAYLOR; 

K  +  IT  OPERATOR  +  (K:  INTEGER?  T:  ITAYLOR)  RES  :  ITAYLOR? 

IT  +  K  OPERATOR  +  (T:  ITAYLOR?  K:  INTEGER)  RES  i  ITAYLOR; 

I  +  IT  OPERATOR  +  (K:  INTERVAL?  T:  ITAYLOR)  RES  :  ITAYLOR? 

IT  +  I  OPERATOR  +  (T:  ITAYLOR;  K:  INTERVAL)  RES  :  ITAYLOR? 

IT  +  IT  OPERATOR  +  (TA,  TB ;  ITAYLOR)  RES  :  ITAYLOR; 


-  T 

K  -  T 
T  -  K 
R  -  T 
T  -  R 
T  -  T 

-  IT 

K  -  IT 
IT  '  K 
I  -  IT 
IT  -  I 
IT  -  IT 


OPERATOR 

OPERATOR 

OPERATOR 

OPERATOR 

OPERATOR 

OPERATOR 

OPERATOR 

OPERATOR 

OPERATOR 

OPERATOR 

OPERATOR 

OPERATOR 


(T:  TAYLOR)  RES  :  TAYLOR? 

(K:  INTEGER;  T:  TAYLOR)  RES  ;  TAYLOR; 

(T:  TAYLOR;  K:  INTEGER)  RES  :  TAYLOR ; 

(R:  REAL?  T:  TAYLOR)  RES  :  TAYLOR; 

(T:  TAYLOR?  R:  REAL)  RES  :  TAYLOR; 

(TA,  TB:  TAYLOR)  RES  :  TAYLOR? 

(T:  ITAYLOR)  RES  :  ITAYLOR? 

(K:  INTEGER;  T:  ITAYLOR)  RES  :  ITAYLOR? 
(T:  ITAYLOR;  K:  INTEGER)  RES  :  ITAYLOR ? 
(K:  INTERVAL;  T:  ITAYLOR)  RES  i  ITAYLOR; 
(T:  ITAYLOR;  K:  INTERVAL)  RES  :  ITAYLOR? 
(TA,  TB;  ITAYLOR)  RES  :  ITAYLOR; 


C.2.  Multiplication  Operators  (Including  TSQR  and  ITSQR) 


RIT_MUL.LIB 
K  *  T 
T  *  K 
R  *  T 
T  *  R 
T  *  T 
TSQR(T) 


REAL  AND  INTERVAL  TAYLOR  MULTIPLY  <<<<<<<<<<<<< 

OPERATOR  *  (K:  INTEGER?  T:  TAYLOR)  RES  :  TAYLOR; 
OPERATOR  *  (T:  TAYLOR;  K:  INTEGER)  RES  :  TAYLOR; 
OPERATOR  *  (R:  REAL;  T;  TAYLOR)  RES  :  TAYLOR? 
OPERATOR  *  (T:  TAYLOR?  R:  REAL)  RES  :  TAYLOR; 
OPERATOR  *  (TA,  TB :  TAYLOR)  RES  :  TAYLOR; 

FUNCTION  TSQR  (T:  TAYLOR)  :  TAYLOR; 
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C.2.  Multiplication  Operators  (Continued) 


K  *  IT 
IT  *  K 
I  *  IT 
IT  *  I 
IT  *  IT 
ITSQR(IT) 


OPERATOR  *  (K:  INTEGER;  T:  ITAYLOR)  RES  :  ITAYLOR; 

OPERATOR  *  (T:  ITAYLOR?  K:  INTEGER)  RES  :  ITAYLOR; 

OPERATOR  *  (K:  INTERVAL;  T:  ITAYLOR)  RES  :  ITAYLOR; 

OPERATOR  *  (T:  ITAYLOR;  K:  INTERVAL)  RES  :  ITAYLOR? 

OPERATOR  *  (TA,  TB:  ITAYLOR)  RES  s  ITAYLOR? 

FUNCTION  ITSQR  (T:  ITAYLOR)  :  ITAYLOR? 


C«3.  Division  Operators* 


RIT_DIV.LIB 
K  /  T 
T  /  K 
R  /  T 
T  /  R 
T  /  T 
K  /  IT 
IT  /  K 
I  /  IT 
IT  /  I 
IT  /  IT 


REAL  AND  INTERVAL  TAYLOR  DIVIDE  <<<<<<<<<<<<<<< 

OPERATOR  /  (K:  INTEGER?  T:  TAYLOR)  RES  :  TAYLOR; 
OPERATOR  /  (T:  TAYLOR?  K;  INTEGER)  RES  i  TAYLOR; 
OPERATOR  /  (R:  REAL?  T:  TAYLOR)  RES  :  TAYLOR? 
OPERATOR  /  (T:  TAYLOR;  R:  REAL)  RES  :  TAYLOR; 
OPERATOR  /  (TA,  TB:  TAYLOR)  RES  :  TAYLOR? 

OPERATOR  /  (K:  INTEGER;  T:  ITAYLOR)  RES  :  ITAYLOR? 
OPERATOR  /  (T:  ITAYLOR;  K:  INTEGER)  RES  :  ITAYLOR? 
OPERATOR  /  (K:  INTERVAL?  T:  ITAYLOR)  RES  :  ITAYLOR? 
OPERATOR  /  (T:  ITAYLOR?  K:  INTERVAL)  RES  :  ITAYLOR; 
OPERATOR  /  (TA,  TB :  ITAYLOR)  RES  :  ITAYLOR; 


C.4.  Real  Power  Operators  and  Functions* 


RT_POW  *  LIB 
TSQR(T) 
TSQRT(T) 
TEXP(T) 


REAL  TAYLOR  POWERS  <<<<<<<<<<<<<<<<<<<<<<<<< 

FUNCTION  TSQR  (T:  TAYLOR)  :  TAYLOR; 

FUNCTION  TSQRT  (T:  TAYLOR)  :  TAYLOR? 

FUNCTION  TEXP  (T:  TAYLOR)  :  TAYLOR? 


K  **  K 
R  **  K 
R  **  R 
K  **  R 
T  **  K 
T  **  R 
R  **  T 
K  **  T 

<P  ★*  fp 


OPERATOR  **  (BASE,  EXPONENT  :  INTEGER)  RES  :  INTEGER; 
OPERATOR  **  (BASE:  REAL;  EXPONENT:  INTEGER)  RES  :  REAL; 
OPERATOR  **  (BASE,  EXPONENT:  REAL)  RES  :  REAL; 

OPERATOR  **  (BASE:  INTEGER;  EXPONENT:  REAL)  RES  :  REAL; 
OPERATOR  **  (BASE:  TAYLOR?  EXPONENT:  INTEGER)  RES  :  TAYLOR; 
OPERATOR  **  (BASE:  TAYLOR;  EXPONENT:  REAL)  RES  :  TAYLOR; 
OPERATOR  **  (BASE:  REAL?  EXPONENT:  TAYLOR)  RES  :  TAYLOR? 
OPERATOR  **  (BASE:  INTEGER;  EXPONENT:  TAYLOR)  RES  :  TAYLOR? 
OPERATOR  **  (BASE,  EXPONENT:  TAYLOR)  RES  :  TAYLOR; 


C.5.  Interval  Power  Operators  and  Functions. 


POW .LIB 

INTERVAL 

TAYLOR  POWERS 

<<<<<<<<<<<<<<<<<<<<<< 

”  ITSPR(IT) 

FUNCTION 

ITSQR  (T: 

ITAYLOR) 

:  ITAYLOR; 

ITSQR (IT) 

FUNCTION 

ITSQR  (T: 

ITAYLOR) 

:  ITAYLOR? 

ITSQRT(IT) 

FUNCTION 

ITSQRT  (T; 

:  ITAYLOR) 

:  ITAYLOR; 

ITEXP(IT) 

FUNCTION 

ITEXP  (T: 

ITAYLOR ) 

:  ITAYLOR; 

I  **  K 

OPERATOR 

** 

(BASE: 

INTERVAL? 

EXPONENT:  INTEGER) 

RES 

:  INTERVAL? 

K  **  I 

OPERATOR 

** 

(BASE: 

INTEGER; 

EXPONENT:  INTERVAL) 

RES 

:  INTERVAL; 

I  **  I 

OPERATOR 

** 

(BASE: 

interval  ? 

EXPONENT:  INTERVAL)  RES 

:  INTERVAL 

IT  **  K 

OPERATOR 

*  it 

(BASE: 

ITAYLOR; 

EXPONENT:  INTEGER) 

RES  : 

ITAYLOR? 

IT  **  I 

OPERATOR 

** 

(BASE: 

ITAYLOR? 

EXPONENT:  INTERVAL) 

RES 

:  ITAYLOR? 

K  **  IT 

OPERATOR 

** 

(BASE: 

INTEGER? 

EXPONENT:  ITAYLOR) 

RES  : 

ITAYLOR? 

I  **  IT 

OPERATOR 

** 

(BASE: 

INTERVAL; 

EXPONENT :  ITAYLOR ) 

RES 

:  ITAYLOR; 

IT  **  IT 

OPERATOR 

** 

(BASE: 

ITAYLOR; 

EXPONENT:  ITAYLOR) 

RES  : 

ITAYLOR; 
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C.6.  Real  and  Interval  Functions  and  Procedures 


RIT_FNS.LIB  - 
TSQR(T) 
TSQRT(T) 
TEXP(T) 
TLN(T) 
TJ5INJC0S 
TSIN(T) 
TCOS(T) 
TRUNGE(T) 
TARCTAN ( T ) 
TTAN(T) 
TDIFF(T) 
TINTGRL ( T ) 


REAL  AND  INTERVAL  TAYLOR  FUNCTIONS 
FUNCTION  TSQR  (Ti  TAYLOR)  :  TAYLOR? 
FUNCTION  TSQRT  (T:  TAYLOR)  !  TAYLOR; 
FUNCTION  TEXP  (Ti  TAYLOR)  :  TAYLOR? 
FUNCTION  TLN  (T:  TAYLOR)  i  TAYLOR ; 
PROCEDURE  T_SINJ=OS  (Ts  TAYLOR?  VAR  S, 
FUNCTION  TsTn  (T:  TAYLOR)  i  TAYLOR? 
FUNCTION  TCOS  (Ti  TAYLOR)  :  TAYLOR; 
FUNCTION  TRUNGE  (Ti  TAYLOR)  :  TAYLOR; 
FUNCTION  TARCTAN  (T;  TAYLOR)  :  TAYLOR? 
FUNCTION  TTAN  (T;  TAYLOR)  :  TAYLOR? 
FUNCTION  TDIFF  (T:  TAYLOR)  :  TAYLOR? 
FUNCTION  TINTGRL  (T:  TAYLOR)  :  TAYLOR; 


<<<<<<<<<<<< 


C:  TAYLOR); 


ITSQR(IT) 
ITSQRT ( IT ) 
ITEXP(IT) 
ITLN(IT) 
IT_SIN_C0S 
ITSIN(IT) 
ITCOS(IT) 
ITRUNGE ( IT ) 
ITARCTAN ( IT ) 
ITTAN ( IT) 
ITDIFF(T) 
ITINTGRL(T) 


FUNCTION  ITSQR  (T;  ITAYLOR)  :  ITAYLOR? 
FUNCTION  ITSQRT  (T:  ITAYLOR)  :  ITAYLOR? 
FUNCTION  ITEXP  (T:  ITAYLOR)  s  ITAYLOR; 
FUNCTION  I TLN  <T;  ITAYLOR)  :  ITAYLOR; 


PROCEDURE  IT_SIN_COS  (T;  ITAYLOR;  VAR  S,  C:  ITAYLOR); 
FUNCTION  ITSIN  (T:  ITAYLOR)  :  ITAYLOR; 

FUNCTION  I TCOS  (T;  ITAYLOR)  i  ITAYLOR? 

FUNCTION  ITRUNGE  (T:  ITAYLOR)  :  ITAYLOR? 

FUNCTION  ITARCTAN  (Ts  ITAYLOR)  :  ITAYLOR? 

FUNCTION  ITTAN  (T;  ITAYLOR)  :  ITAYLOR? 

FUNCTION  ITDIFF  (T:  ITAYLOR)  :  ITAYLOR? 

FUNCTION  ITINTGRL  (T:  ITAYLOR)  :  ITAYLOR; 


C . 7.  Utilities* 


UTIL. LIB  -  UTILITY  PROCEDURES  &  FUNCTIONS  <<<<<<<<<<<<<<<< 

FUNCTION  VRNULL  ;  R VECTOR? 

FUNCTION  I VRNULL  :  IVECTOR? 

FUNCTION  T_IDENT_ZERO  (T:  TAYLOR)  :  BOOLEAN? 

FUNCTION  T_IDENT__C0NSTANT  (T:  TAYLOR)  ;  BOOLEAN; 

FUNCTION  IT_IDENT_ZERO  (Ti  ITAYLOR)  i  BOOLEAN? 

FUNCTION  I T_ID ENTjSON ST ANT  (T:  ITAYLOR)  :  BOOLEAN? 

PROCEDURE  WRITE_INTERVAL  (INTi  INTERVAL); 

PROCEDURE  WRITE_INTERVAL_SERIES  (SER;  ITAYLOR); 

PROCEDURE  READ_INTERVAL_SERIES  (VAR  F:  ITAYLOR); 

PROCEDURE  WRITE_SERIES  (Ti  TAYLOR)? 

FUNCTION  ITMIDPT  (F  i  ITAYLOR)  :  TAYLOR? 
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ON  THE  EXTREMUM  OF  BILINEAR  FUNCTIONAL 
FOR  HYPERBOLIC  TYPE  P.D.E. 

C.  N,  Shen 

U.S.  Army  Armament,  Munitions,  and  Chemical  Command 
Armament  Research  and  Development  Center 
Large  Caliber  Weapon  Systems  Laboratory 
Benet  Weapons  Laboratory 
Watervliet,  NY  12189 


ABSTRACT .  Transient  solutions  of  the  hyperbolic  type  partial 
differential  equations  are  needed  for  solving  many  engineering  problems  such 
as  computing  stress  waves  for  gun  dynamics  or  determining  shock  behaviors  of 
penetration  mechanics. 

Variational  procedures  using  the  bilinear  formulations  with  adjoint 
variables  can  serve  as  the  theoretical  basis  in  the  derivation  of  algorithms 
for  the  finite  element  methods,  giving  direct  numerical  solutions  for  partial 
derivatives  of  the  functions  to  be  found  for  these  problems.  The  adjoint 
system  can  be  arranged  in  a  manner  that  it  is  a  reflected  mirror  of  the 
original  system  in  time.  GeneraliEed  boundary  conditions  employ  many  types  of 
"springs"  relating  the  various  spatial  partial  derivatives.  They  are  defined 
to  satisfy  the  boundaries  of  the  concomitant  for  the  bilinear  expression. 
Algorithms  for  use  in  the  finite  element  method  are  simplified  since  the 
adjoint  system  gives  exactly  the  same  solutions  as  that  of  the  original 
system.  The  second  necessary  condition  for  an  extremum  is  satisfied  by 
showing  that  the  second  variation  is  positive  semi-def inite, 

I*  INTRODUCTION,  Transient  solutions  of  the  hyperbolic  type  partial 
differential  equation,  for  example  the  wave  equation  or  the  beam  equation,  are 
important  for  solving  engineering  problems  such  as  stress  wave  for  gun 
dynamics  or  shock  behavior  of  penetration  mechanics.  At  present  these 
equations  are  usually  solved  numerically  by  the  finite  difference  method  or  by 
the  Galerkin  method.  Considerable  advantage  may  be  obtained  if  the  finite 
element  method  can  be  directly  employed  instead.  Variational  procedures  using 
bilinear  formulation  with  adjoint  variables  can  serve  as  the  theoretical  basis 
for  the  derivation  of  algorithms  using  the  finite  element  method  for 
hyperbolic  type  p.d.e, 

II.  THE  VARIATIONAL  PRINCIPLE.  A  dynamical  system  can  be  modeled  by  the 
following  partial  differential  equation. 

LU)  y(  C)  -  -QU)  (1) 

with  appropriate  boundary  and  initial  conditions.  In  the  above  equation  L  is 
a  linear  operator  in  both  spatial  and  temporal  domain,  y  is  the  dependent 
variable,  Q  is  a  forcing  function,  and  £  represents  all  independent  variables, 
both  spatial  and  temporal. 

The  inner  produce  <  >  of  an  adjoint  forcing  function  Q  and  the  solution 
(y(O)  of  Eq^  (1)  can  be  used  for  the  purpose  of  estimation.  This  inner 
product  is  <Q,y>. 
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An  accurate  estimation  can  be  made  by  constructing  a  variational 
principle  [1],  By  using  the  adjoint  variable  y  as  a  Lagrange  multiply  for 
Eq.  (1)  adding  to  <Q,y>,  we  have 

Jlty,y]  =  <Q»y>  +  <y,(Q+*Ly)>  -  <Q,y>  +  <yfQ>  +  <y»Ly>  (2) 

To  keep  the  system  symmetrical,  let  us  define  the  adjoint  system  as 

L(OyU)  *  -QU)  (3) 

By  using  the  original  variable  y  as  a  Lagrange  multiply  for  Eq.  (3)  adding  to 
<Q,y>,  we  have 

J2[y»yJ  s  <Q*y>  +  <y»(Q+-Ly)>  ■  <Q»y>  +  <y»Q>  +  <y*^y>  (4) 

By  definition,  the  relationship  of  the  adjoint  system  to  the  original  system 
is 

A  -  — 

D  =  <y,Ly>  -  <y,Ly>  =  0  (5) 

where  D  is  the  bilinear  concomitant  [1].  Combining  Eqs.  (2),  (4),  and  (5)  one 
obtains 


Jl  =  J2  (6a) 

In  order  to  keep  the  functional  symmetrical,  we  have 

A  1 

-J  -  -  [Ji  +  J2]  (6b) 

2 

which  is  of  the  form 


J  -  <Q,y>  +  <y,Q>  +  -  <y,Ly>  +  -  <y,Ly> 


(6c) 


To  show  that  the  above  functional  satisfies  both  the  original  and  the 
adjoint  systems,  let  us  take  the  first  variations  of  Eqs.  (5)  and  (6)  which 
gives 

<SJ  =•  <$J(5y)  +  6J(<Sy)  (7a) 


where 


and 


Also 


1  1 

5 J(6y)  =  <<5y,Q>  +  -  <6y,Ly>  +  -  <y,L5y>  =  0 


1  —  1  - 

<5j(3y)  -  <6y,Q>  +  -  <5y,Ly>  +  -  <y,L<5y>  «  0 


(7b) 

(7c) 


SD  =  5D(6y)  +  6D(6y) 


(8a) 


where 


SD(<Sy)  =  <<5y,Ly>  -  <y,L<5y>  “  0  (8b) 

and 

5D(Sy)  =  -  <5y ,Ly>  +  <y,L6y>  =  0  (8c) 

From  Eqs*  (7b)  and  (8b)  we  obtained 

l  -  1  - 

6 J(<5y)  =  <6y,Q)  +  -  <6y,Ly>  +  -  <5y,Ly>  =»  <6y,(q+Ly)>  -  0  (9) 

For  arbitrary  <$y  satisfying  certain  general  limitations  on  the  boundaries  it 
can  be  shown  that  the  Euler-Lagrange  Equation  for  the  original  system  in  Eq. 
(1)  is  satisfied-  From  Eqs.  (7c)  and  (8c)  we  get 

<5J(<$y)  =  <Sy ,Q>  -  -  <6y,Ly>  +  -  <6y,Ly>  =  (5y»(Q+Ly)>  -  0  (10) 

For  arbitrary  variation  5y,  the  Euler-Lagrange  Equation  for  the  adjoint  system 
in  Eq*  (3)  is  also  satisfied. 

Ill*  INTEGRAL  OF  BILINEAR  EXPRESSION*  The  integral  of  a  bilinear 
expression  for  a  two  dimensional  problem  having  second  order  partial 
derivatives  in  time  and  fourth  order  partial  derivatives  In  space  can  be 
written  as 

Xb  tb 

I  “  /  J  G[y(x,t) ,y(x,t) ]dtdx  (11) 

*0  z0 

where  G(y,y]  is  a  given  bilinear  expression  in  the  form 

«[y,y]  -  ytyt  -  “2yy  -  a2yxyx  -  b2yxxyxx  (12) 

The  subscripts  t  and  x  indicate  the  partial  derivatives  for  the  functions  y 
and  y- 

Equation  (12)  can  be  integrated  by  parts*  Two  different  forms  of 
integration  and  end  conditions  are  given.  The  first  form  of  the  Integral  is 
obtained  by  integrating  by  parts  on  the  adjoint  variable. 

tb  xb  _  xb  -  tb 

l\  s  “J  J  yLydtdx  +  J  yty|  dx 

t0  Xo  tQ 

tb  -  xb  -  xb  -  xb 

+  /  t-i>2yxxyxl  +  0>2yxx)xy|  ~  *2yxy|  >dt  (i3a) 

tQ  Xq  Xq  xq 
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On  the  other  hand,  we  can  perform  integration  on  the  original  variable  to  give 


cb  xb  —  ,xb  -  tb 

i2  ~  J  yLydtdx  +  /  yty|  <*x 

to  xo  Xo  t0 

tb  -  xb  *b  xb 

+  Ji  t-b2yxxyx|  +  b2(yxx)xy|  -  a2yxy|  }dt  (13b) 

t  Q  Xq  XQ  Kq 

To  keep  the  form  symmetrical,  we  take  the  average  of  the  above  two  expressions 

1  1  *b  Cb  1  -  —  1  xb  -  -  tb 

I  -  -  II  +  -  12  -  -/  J  -(yLy+yLyMcdx  +  -  J  (yty+yty)  I  dx 

2  2  x0  t0  2  2  xa  tQ 

1  tb  ,  -  -  Xb 

+  -  f  (-a2)(yxy+yxy)|  dt 

2  t0  x0 

1  tb  -  -  Xb  1  tb  -  -  Xb 

+  -  /  (-b2)(yxxyx+yxxyx) |  it  -  -  J  [(-b2yxx)xy  +  (-b2yxx)xy)  dt  (14) 

2  tQ  x0  2  t0  xG 


nUWL V  .  ^  _ 

by  =  ytt  +  “  y  -  ayXX  +  bzyxxxx  (15a) 

and  —  -  -  -  - 

by  =  ytt  +  u2y  -  a2yxx  +  b2yxxxx  (15b) 

For  a  fourth  order  spatial  partial  and  a  second  order  temporal  partial  system 
Eq.  (5)  becomes 

xb  tb  -  xb  tb  — 

D  =  /  /  yLydtdx  -  /  /  yLydtdx  (16a) 

xo  to  xo  t0 

By  equating  Eqs.  (l 3a)  and  (13b)  and  solving  for  D  in  Eq.  (16a)  we  are 
converting  the  double  integral  into  single  integrals  in  terms  of  the  boundary 
conditions. 

We  can  express  the  quantity  D  as  the  sum  of  three  parts  for  end 
conditions  ,  D2,  and  D3  as 


D  =  Dx  +  D2  +  D3  (16b) 

The  terms  in  involve  the  initial  conditions  of  y  and  y  as 

xb  -  tb  tb 

Di  =  /  (yty|  *  ytyl  >d* 

x0  to  t0 
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.  *b  - 

D1  “  /  dx{[yt(x,tb)y(x,ti,)  -  yt(x,tb)y(x,tb)] 

xo 

-  [yt(xft0)y(x»t0)  -  yt(x,to)y(x,t0)J}  (17a) 

The  terms  in  D2  involve  the  boundary  conditions  from  the  second  partials  of  y 
and  y  as 

tb  -  Xb  -  xb 

D2  *  /  (-a2){yxy|  -  yxy|  }dt 

t0  Xq  Xq 


tb  - 

D2  ~  f  dt(-a2[yx(xb,t)y(xb,t)  -  yx(xb,t )y(xb,t) 
to 

+  a2[yx(x0,t)y(x0,t)  -  yx(x0 ,t)y(x0 ,t ) J } 


(17b) 


The  terms  in  D3  involve  the  boundary  conditions  from  the  fourth  partials  of  y 
and  y  as 

tb  -  Xb  xb  -  Xb  -  Xb 

°3  -  I  ^”b2yxxyx|  +  b2yxxyx|  +  (b2yxx)xy |  +  (-b2yxx)xy|  }dt 

to  Xo  x0  Xo  x0 


D3  =■  /  dfc{-b2tyxx.(xb»t)yx(xb,t)  -  yXx(xb.t)yx(xb,t)] 

to 

+  bZ[yxx(xo»t)yx(xo»t)  -  yxx(xo.t)yx(x0.t)]> 
tb  - 

+J  dt{-b2[-yxxx(xb>t)y(xb»t)  +  yXxx(xb.t)y(xb,t) ] 
co 

+  b2[-yxxx(x0,t)y(x0>t)  +  yx(xQ ,t)y(x0 , t) ] } 

In  order  that  D  =  0  in  Eq.  (16b)  it  is  sufficient  that 

Di  =  0 
D2  -  0 

and  D3  =  0 


(17c) 


(18a) 

(18b) 

(18c) 


IV.  THE  SYMMETRICAL  ADJOINT  SYSTEM.  The  adjoint  independent  variable  t 
in  Figure  1  can  be  expressed  as 


xb  -  t  t  -  t0 

m  ■  1  1  1  ii  3  .  '«■  1 — 

Tb  -  To  tb  -  t0 


(19) 
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which  gives 


T  *  Tb  for  t  -  t0 

(20a) 

and 

T  -  t0  for  t  -  tb 

(20b) 

It  is  noted  from  Eq 

.  (19)  that 

"  To  =  *b  “  tQ 

(21a) 

x  *  Tb  +  to  “  t 

(21b) 

d t  -  -dt 

(21c) 

• 

d  d 

dr  dt 

(21d) 

and 

y(x,t)  =  y(x,t  »  Tb+t0-t) 

(21e) 

Let  us  assume  that 

the  adjoint  system  shown  in  Figure  1  gives 

y(x,t-t)  =  y(x,t=»tb+t0-t) 

A  A 

(22a) 

yt(x,t=t)  *  -yt(x,t^tb+t0-t) 

A  A 

(22b) 

yx(x,t-t)  =  yx(x,t=tb+tQ~t) 

(22c) 

where  t  is  a  dummy  variable  for  t. 

We  may  define  the  adjoint  system  as  the  image  reflection  in  the  time 
domain  of  the  original  system.  Equation  (22)  yields  the  following  known 
initial  conditions  * 

A  A 

y(x,t»tb)  ■  y(x,t-t0)  (known)  (23a) 

A  A 

yt(x,t=tb)  -  -yt(x»t”t0)  (known)  (23b) 

The  interpretation  of  the  above  equations  gives  the  initial  conditions  of  the 
original  system  as  the  far  end  conditions  for  the  adjoint  system,  since  the 
adjoint  system  is  a  reflected  mirror  of  the  original  system  in  time. 

V.  INITIAL  CONDITIONS  FOR  THE  ADJOINT  SYSTEM.  We  take  a  symmetry 
approach  for  the  initial  conditions  of  the  adjoint  system  as 


y(x,t=tb) 

=  y(x,t=t0) 

,  yt(x.t=tb) 

=  -yt(x,t*t0) 

(24) 

y(x,t=t0) 

=  y(x,t=-tb) 

,  yt(x»t=tQ) 

"  -yt(x»t=*tb) 

(25) 

Thus  Eq.  (17a)  becomes 
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Dl  ■  J  dx{[yt(x,t=-tb)y(x,t=t0)  +  yt(x,t«t0)y(x,t-tb) 
x0 

-  [yt(x,t=t0)y(x>t*tb)  +  yt(x,t=tb)y(x,t=t0)l}  =  0  (26) 

Since  the  integrand  of  Eq.  (26)  is  zero,  the  above  satisfies  Eq.  (18a).  The 
initial  conditions  in  Eq.  (25)  are  given.  Therefore 

6y(x,t=tb)  =  6y(x,t»t0)  “  0  (27a) 

5yt(x,t»tb)  “  -6yt(x,t=t0)  =  0  (27b) 

VI.  THE  GENERALIZED  BOUNDARY  CONDITIONS.  Let  us  consider  the  operator  L 
in  Eq.  (15a)  for  two  different  cases  as  follows. 

A.  For  the  wave  equation  we  have 

Ly  =  ytt  "  *2yxx  U8) 

Let  us  assume  that  elastic  springs  are  installed  at  the  ends  such  that 

YxCxb^)  *  kby(xb,t)  ,  yx(xb,t)  -  kby(xb,t)  (29a) 

yx(x0,t)  =  -k0y(x0,t)  »  yx(xo»t)  =  -koy(x0ft)  (29b) 

This  is  equivalent  to  state  that  the  fixed  end  condition  for  a  prismatic  bar 
is  k-b  *  k0  *  ®  and  the  free  end  condition  is  kb  ■  *  0.  If  Eq,  (29)  is 

substituted  into  Eq.  (17b)  we  have 

D2  *  0  (30) 

B.  For  the  beam  equation  we  have 

Ly  =  ytt  +  b2yxxxx  (31) 

Two  sets  of  springs  are  incorporated  at  the  ends.  They  are: 

(1)  Torsional  springs  relates  the  moments  (the  second  partials)  with  the 
slopes  (the  first  partials) 

yxx(xb*t)  =  nbyx(xb,t)  yxx(xb>t)  =  nbyx(xb,t)  (32a) 

yXx(x0,t)  =■  -noyx(x0,t)  yxx(x0»t)  “  -noyX(x0>t:)  (32b) 

(2)  Linear  springs  relates  the  shears  (the  third  partials)  with  the 
deflection  (no  partials) 

yxxx^xb»t)  *  Cby(xb,t)  yXxx(xb>t)  =  Cby(xb,t)  (33a) 

yxxx^xo » t )  =  -C0y(x0,t)  yXxx(x0,t)  =  -C0y(x0»t)  (33b) 
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By  substituting  Eqs.  (32)  and  (33)  into  Eq*  (17c)  we  have 

D3  3  0  (34) 

Table  I  shows  the  assignment  of  the  spring  constants  for  various  physical  end 
conditions. 


TABLE  I.  GENERALIZED  BOUNDARY  CONDITIONS 


At  Fixed  End 

At  Hinged  End 

At  Guided  End 

At  Free  End 

y=y=o 

y»y»0 

yx=yx=° 

yXx=yxx=0 

yx=yx=o 

yxx=yxx=o 

yxxx3yxxx=° 

yxxx“yxxx=0 

6y=6y=0 

5y=6y-0 

5yx=5yx=0 

5yxx”«yxx=0 

'5yx=,syx-0 

5yxx=5yxx=0 

,syxxx=<syxxx=i0 

^yxxx=  6yxxx=° 

Torsional 

Spring 

A 

yxx=fiyx 

n  « 

n  ■*  0 

n  +  00 

r»  0 

Deflection  Spring 

Yxxx=cy  c  ” 

c  +  • 

c  ♦  0 

c  0 

Spring 

yx=ky 

6y-6y-0 

5yx=6yx=0 

k  ■*  » 

k  ->■  0 

k  =  undetermined 

VII,  THE  FIRST  VARIATION.  The  sum  of  the  two  functionals  is  obtained  by 
adding  Eqs.  (6c)  and  (14)  as 


xb  tb  -  - 

J  +  I  -  /  /  (QybyQ)dxdt  +  T  +  W  +  B  (35) 

xo  t0 

where 

1  *b  tb  1  tb  -  -  xb 

T  *  r  J  Cyty+yty)|  dx  »  w  *  “  J  C-a2)(yxy+yxy) I  dt 

2  xQ  to  2  t0  x0 


and 

1  tb  -  -  xb  1  tb  -  -  xb 

B  -  -  /  (“b2)(yxxyx+yxxyx) |  dt  -  -  /  K-b2yxx)xy  +  (-b2yxx)xy|  dt  (36) 

2  t0  Xq  2  t0  xQ 
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By  taking  the  variations  6y  and  6y  separately,  we  let 

6J  =  6j(Sy)  +  6J(5y)  (37) 

Then  one  obtains  from  Eqs.  (35)  and  (36)  that 

Sj(6y)  =  -<51(<$y)  +  //  Q<5y  dxdt  +  6l(6y)  +  <SW(6y)  +  6B(<Sy)  »  0 

where 

-  1  ,xb  —  —  tb  -  1  ■  —  —  Xb 

6T(5y)  =  -  /  (ytsy+y5yt)|  dx  .  W&y)  =  -  J  (~a2)(yx6y+ySyx)  I  dt 
2  *o  to  2  t0  x0 

and 

6B(6y)  -  -  /  (-b2)(yxx6yx+yx6yxx)|  bdt 
2  to  xo 

-  ”  /  [(-b2)yxxx5y  +  (-b2)y5yxxx]  dt  (38) 

2  to  Xo 

where  — 15 X( <5y)  can  be  derived  from  Eqs,  (11)  and  (12)  as 


-  Xfc  tfc 

-6i(6y)  =  -/  /  (yt<syt'w2y(5y“a2yx'syx-,b2yxx<5yxx)dxdt  (39) 

xo  to 


The  second  terra  on  the  right  side  of  Eq.  (37)  is 

5j(5y)  *»  -<$I($y)  -I-  //  Q6y  dxdt  +  6T(<5y)  +  6W(6y)  +  <5B(<$y)  -  0 

where 

1  xb  —  —  tb  1  t^  —  -  Xb 

6T(6y)  =  -  J  (yt5y+y6yt)  |  dx  ,  SW(Sy)  -  -  /  (-a2) (yx<5y+y6yx)  |  dt 

2  tQ  2  t0  xQ 


6B(5y)  -  -  J  ('b2)(yxx6yx+yx6yxx)|  dt 
2  tQ  x0 


[  (~b2yxxx) 6y-b2y6yxxx]  dt 

Xo 


and 


xb  tb  -  - 

-6i(6y)  ~  -  /  /  (yt<syt_(i)2y6y“a2yx<syx(”b2)yxx6yxx)dxdt 

xo  tQ 

It  is  noted  that  Eqs.  (38)  and  (40)  are  exactly  the  same  form,  where  Eqs. 
and  (41)  are  also  similar. 


(40) 


(41) 

(39) 
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For  the  beam  equation  it  is  noted  that  the  high  partials  in  Eqs.  (38)  and 
(39)  can  be  replaced  by  Eqs,  (32)  and  (33).  The  variations  of  the  adjoint 
higher  partials  from  these  equations  can  be  written  as 

5yXx<Xb,t)  »  nbSyx(xb,t)  <5yxxx(xb,t)  -  cb6y(xb,t)  (42a) 

6yxx(xo»£)  “  "no5yX(xott)  6yXxx(xo»t)  *  “C06y(x0,t)  (42b) 

Equations  (38)  and  (39),  with  the  aid  of  Eqs.  (32),  (33),  and  (42),  are 
the  key  equations  to  be  used  for  the  finite  element  method.  It  is  noted  that 
the  first  variation  6J(<$y)  is  the  same  as  the  first  variation  <5J(<5y)  by  adding 
or  dropping  the  bar  on  top  of  the  variables  and  their  variations.  We  do  not 
need  to  solve  for  the  adjoint  system  in  Eqs.  (40)  and  (41)  since  they  give 
exactly  the  same  solutions  as  that  of  the  original  system. 


VIII.  SECOND  VARIATIONS.  The  functions  y  and  y  and  their  partials  are 
written  in  the  form  in  terms  of  a  small  parameter  y 


y(x,t,y)  =  y(x,t)  +  6y(x,t,y) 

,  <$y(x,t,u)  =  yn(x,t) 

(43a) 

yt(x,t,y)  «  yt(x,t)  +  5yt(x,t,y) 

,  6yt(x,t,u)  =  ynt(x,t) 

(43b) 

yx(x,t,y)  =*  yx(x,t)  +  <5yx(x,t,y) 

,  5yx(x,t,u)  =  ynx(x,t) 

(43c) 

y(x,t,y)  =  y(x,t)  +  Sy(x,t,y) 

,  6y(x,t,u)  =  un(x,t) 

(43d) 

yt(x,t,y)  =■  yt(x,t)  +  6yt(x,t,y) 

»  <$yt(x,t,u)  18  ynt(x»t) 

(43e) 

yx(x,t,y)  -  yx(x»t)  +  Syx(x,t ,y) 

,  5yx(x,t,u)  =  yr^Cx^) 

(43f) 

Similar  expressions  can  be  derived  for  higher  partials  in  x.  Thus,  the 
functional  J(|i)  can  be  expressed  [2]  as 


J(u)  =  J(u=0)  +  6J  +  S2J 


(44a) 


where 


aj 

6J  =  y(— ) 

ua0 


(44b) 


and 


<$2J 


y2  32j 

2  3y2^u=0 


(44c) 


By  taking  variations  of  6J(<Sy)  in  Eqs.  (38)  and  (39)  and  some  for  6J(5y)  in 
Eqs.  (40)  and  (41),  we  have 

62 J  =  62T  +  <52B  +  62W  -  62I  (45a) 


where 


52T 


1  x0  -  -  tb 

-  /  (<$yt<5y+6ySyt)  I  dx 

2  xb  t0 


(45b) 
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fi2e  *  r  J  b  (-b2)(6yxx6yx+6yx«yxx)|  b<*t 
2 


and 


+  ;  /  bb2(6yxxxfiy+5y6yxxx)l  b<it 

2  t0  xQ 

1  tjj  Xfc 

<$2W  =  -  f  (-a2)(6yx<5y+6y6yx) |  dt 

2  tQ  x0 


The  second  variation  of  I  is  obtained  from  Eqs.  (39)  and  (41)  as 


,  V2  82I 

s  1  “  r  cr2} 

2  3(iz  U.Q 


(45c) 

(45d) 


1  1  - 

-  -  Sy[SI(6y)]  +  -  6y[6I(«y)] 

2  2 

1  xb  tb 

=  r  /  /  (<5yt,Syt"w26y(5y“a25yx6yx-b26yxx6yxx)dxdt 

2  Xq  tQ 

Mb  cb  -  - 

+  -  /  /  (<5yt,5yt-w2,sy,5y-a26yx6yx-b26yxx6yxx)dxdt 

2  Xq  t0 

xb  tb  - 

62i  =  /  f  (6yt<syt_a)2<sy<sy_a2,syx5yx“b2<syxx,syxx)ti5Ctit  (45e) 

xo  t0 

Substituting  Eq,  (27)  into  Eq.  (45b)  we  have 

S2t  =  0  (46a) 

For  all  the  end  conditions  in  Table  I  either  the  variations  <5yxx  and  5yxx  must 
vanish  or  <5yx  and  <5yx  must  vanish.  Thus,  the  first  term  on  the  right  side  of 
Eq.  (45c)  is  zero.  Similarly,  for  all  the  end  conditions  in  Table  X  either 
the  variations  <5yxx  and  Syxx  must  vanish  or  <5y  and  <5y  must  vanish.  Thus  the 
second  term  on  the  right  side  of  Eq.  (45c)  is  also  zero.  The  third  term  is 
zero  except  at  the  guided  end.  Thus,  in  general 

62B  =  0  (46b) 

In  Table  I,  except  the  free  end,  either  the  <$yx  and  5yx  must  vanish  or  6y 
and  6y  must  vanish.  Thus,  one  obtains 

62W  =  0  (46c) 
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(47) 


This  reduces  the  second  variations  <S2J  to 

52  J  =  -62I 

as  given  in  Eq.  (45e). 

Substituting  Eq.  (45e)  into  Eq.  (47)  gives 

S2J  -  /  h  /  ^  [-6yt(x,t)6yt(x,t)  +  u>25y(x,t)<Sy(x,t)  + 
to  xo 

+  a26yx(x,t)<5yx(x,t)  +  b26yxx(x,t) 6yxx(x,t)  Jdxdt  (48) 

In  order  that  the  functional  J  is  an  extremum  [3,4]  ,  the  second  variation  6  J 
must  be  either  positive  semi -definite  or  negative  semi-finite,  i.e. , 

S2J  >  0  (or  <$2J  <  0)  (49) 

The  above  is  a  necessary  condition  for  a  minimum  (or  a  maximum) • 

The  adjoint  variations  in  Eq.  (48)  may  be  obtained  by  the  relations  given 


in  Eq.  (22)  as 

A  A 

<Sy(x,t-t)  -  6y(x,t=tb+t0-t)  (50a) 

A  A 

<5yt(x,t=t)  -  -<$yt(x,t-tb+t0-t)  (50b) 

A  A 

6yx(x»t^t)  =  5yx(x,t=tb+t0-t)  (50c) 

The  variations  of  adjoint  initial  conditions  can  be  derived  from  Eq.  (23)  as 

^  A  A 

<5y(x,t=tb)  -  5y(x#t=tb)  ■  0  for  all  x  (51a) 

A 

<5yt(x»t=tb)  =  -<5yt(x,t*t0)  “  0  for  a11  x  (51b) 

By  substituting  Eq.  (51)  into  Eq.  (48),  we  have 


52J  =  /  /  P(x,t)dxdt 

to  xo 

where 

P(x,t)  ~  <5yt(x,t)6yt(x,tb+t0-t)  +  w26y(x,t)  6yx(x,tb+t0-t) 
+  a26yx(x,t)6yx(x,tb+tQ-t)  +  ba6yxx(x,t)<5yxx(x,tb+t0-t) 


(52a) 


(52b) 
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*  IX«  SENSITIVITY  RELATIONSHIP .  In  order  to  show  that  the  second 
variation,  of  the  functional  J  is  positive  semi-definite,  one  needs  to  obtain 
the  variations  of  the  function  and  its  partials  together  with  that  of  the 
adjoint  functions  and  i'ts  partials  as  indicated  in  Eq,  (48).  We  can  get  these 
variations  through  the  study  of  the  sensitivity  coefficients  [5]  and  its 
relationship  to  the  parameters  given  in  Eq,  (43),  Let  the  forcing  function  in 
Eq.  (1)  be 


Q(x,t)  *=  qf (x,t)  (53) 

It  is  assumed  that  the  forcing  function  parameter  q  is  subject  to  a  small 
constant  perturbation  <5q  as 


q  =  q0  +  6q 


Then  the  variation  of  the  function  y  is 

c  ,  9y(x,t) 

<$y(x,t)  =  - 

aq 

where 

v(x,t)  * 


<Sq  -  v(x,t)iq 

Oy 

3q 


(55a) 


(55b) 


The  quantity  v  is  the  sensitivity  coefficient  for  the  variation  Sy(x,t)  due  to 
a  small  constant  perturbation  6q. 

The  original  p.d.e,  in  Eq.  (15a)  can  be  written  as 
<j>  *  Ly  +  Q 

=  ytt  +  “2y  “  a2yxx  +  b2Yxxxx  +  qf (x,c)  -  0  (56) 

Due  to  the  perturbation  of  q  the  change  of  <J>  obeys  the  following,  relationship 


3ytt  3<1 


8yxx  aq 

3Xxxxx 

(56)  that 

3  <J> 

-  -  i 

3  4* 

3ytt 

3y 

J±.  =  -a2 

3<j> 

3yxx 

3yxxxx 

(57) 


UT 


(58a) 


(58b) 
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Using  Che  definition  in  Eq.  (55b)  the  partials  can  be  interchanged  as 


and 


Sytt  =  3^_  3y 

3q  3t2  3q 


V 


tt 


3yxx  32  3y 

3x2  (3q)  =  Vxx 


8yxxxx  3 4  3y^ 

3q  3x4  3q 


vxxxx 


(59a) 

(59b) 

(59c) 


Substituting  Eqs,  (58)  and  (59)  into  Eq.  (57)  we  have 

vtt  +  oj2v  -  a2VxX  +  b2vxxxx  +  f(x,t)  ■  0  (60) 

If  we  compare  the  definitions  of  variation  in  Eq.  (43a)  with  the  definition  of 
sensitivity  relationship  in  Eq.  (55a)  we  have 

<5y(x,t)  =  yn(x,t)  -  (<Sq)v(x,t)  (61) 


which  gives 


n(x,t)  -  v(x,t) 


(62a) 


and 


<Sq  ■  P 


(62b) 


Thus  Eq.  (60)  becomes 

ntt  +  w2n  -  a2HxX  +  h2nxxxx  +  f(x»t)  =  0  (63) 

which  gives  the  p.d.e*  of  the  variations  of  the  original  system. 

If  we  compare  Eq.  (63)  with  Eq.  (56)  we  see  that  the  variation  n(x,t)  = 
p"16y(x,t)  in  Eq.  (63)  takes  the  place  of  the  function  y  in  Eq.  (56)  with  q  * 
1.  Therefore,  the  p.d.e.  for  the  variations  is  unchanged  except  by  a  scale 
factor.  Thus  the  solution  of  the  variation  6y(x,t)  has  the  same  form  as  that 
of  the  original  function  y. 

Similarly  for  the  adjoint  system  one  can  obtain 

5y(x,t)  -  pn(x,t)  -  (<Sq)v(x,t)  (64) 

n(x,t)  =*  v(x,t)  (65a) 

<5q  53  y  (65b) 
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and 


ritt  +  ^  "  a2rhra  +  b2rlXxxx  +  -  0  (66) 

which  is  the  p.d.e,  of  the  variations  of  the  adjoint  system. 

X,  EXTREMAL  OF  FUNCTIONAL  FOR  A  SIMPLE  OSCILLATOR.  To  show  that  S2J 
must  be  positive  semi -definite  we  start  with  an  example  by  a  simple  harmonic 
oscillator  with  no  forcing  function.  Thus  from  Eq.  (63)  we  have  the  ordinary 
differential  equation  [6] 

ntt  +  w2ti  =  0  (67) 

The  solution  for  the  above  equation  is 

5y  =  tin  =  A  cos(wt+0)  (68a) 

6yt  =  =  -oiA  sin(oit+6)  (68b) 

Both  A  and  0  can  be  determined  from  the  following  given  initial  conditions 

<$y(t=0)  =  <Sy(0)  =  A  cos  9  (69a) 

6yt(t=0)  =  <5yt(0)  =  -uiA  sin  0  (69b) 

For  computation  by  the  finite  element  method  the  increment  time  is  taken 
as  T  which  gives 

T  -  tb  -  tQ  -  (-)W  (70) 

oi  2 


where  n  =  1,2,3. . . 

The  image  function  becomes 

6y(t=T-t)  =  A  cos  [  9-Ho(T-t)  ]  (71a) 

A 

6yt(t=T"c)  =  ““A  sin[ 9+w(T-t) ]  (71b) 

For  the  ordinary  differential  equation  we  have  the  second  variation  from  Eq. 
(52)  which  gives 

^  *  * 

62J  =  /  [ 6yt(x , t=t) 6yt(x ,t=T-t) 

o 

A  A 

+  (D25y(x,t=t)5y(x,t=T-t)]dt  (72) 

Separating  Eq,  (72)  into  two  parts  and  using  Eqs.  (68)  and  (71)  we  have 

S2J  =  62J[6yt]  +  <S2J[  oiSy  ]  (73a) 
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where 


(JJl 

S2J[Syt]  =  j  <o2A2  s  in( 9+wt ) s in( 9+wT- wt )  d  ( tot ) 
o 

iiiT 

(S2J[wfiyl  =  /  oi2A2  cos(  0+cot)cos( 0+wT-wt)d( wt) 


and 


uT  =  n(-) 
2 


which  is  a  multiple  of  tt/2. 

The  trigonometric  relationship  for  Eq.  (73)  is 

sin(tot+3)  =  sin  tot  cos  0  +  cos  tot  sin  0 
cos(wt+0)  *  cos  tot  cos  0  -sin  tot  sin  0 
sin( 9+wT-wt)  -  -sin  [tot  -  (0+niT/2)j 
=  -sin  wt  cos(9-Hitt/2)  +  cos  tot  sin(  9+mr/2) 
and  cos  ( 0+wT-tot)  =  cos  [tot  “  ( S-Hri'n/ 2)  ] 

=  cos  tot  cos(9+nir/2)  +  sin  tot  sin(0+mr/2) 
For  the  case  when  n  is  odd,  we  have 

n+1 


(73b) 

(73c) 

(73d) 


(74a) 

(74b) 

(74c) 

<74d) 


cos(0+nir/2) 

=  (-1)  2 

sin  9 

(75a) 

n-1 

sin(  Q+nir/2) 

“  (-1)  2 

cos  9 

(75b) 

For  the  case  when  n  is  even,  we  have 

cos(  0+nir/2) 

=  (-l)n/2 

cos  9 

(76a) 

sin(  Q+nir/2) 

-  (-l)n/2 

sin  0 

(76b) 

First,  we  take  the  case  when  n  is  odd.  Substituting  Eqs#  (74)  and  (75)  into 
Eq.  (73),  one  obtains 

52J[5ytl  -  to2A2/  {(sin  tot  cos  0  +  cos  <ot  sin  0) 
o 

•  [-  sin  tut( —1 ) Cn+1 ) / 2  Sin  q  +  cos  oit(-l) (n"l )/2  cos  0]}d((ot) 

■  (-l)(n-l)/2  w2a2/  ^  [sin  9  cos  6  +  sin  wt  cos  wt]d(tot) 

o 


(“l)^n"’^/^  w2A2[-  4-  —  sin  0  cos  0] 
2  2 


(77a) 
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and 


52J[oi6y]  =  (d2A2/  {(cos  u>t  cos  6  -  sin  wt  sin  0) 
o 

•  [cos  o)t(-l)(n+1)/2  sin  0  +  sin  tut  (-l)(n-l)/2  cos  0]}d(oot) 

■  (-l)(n-l)/2  oj2a2/  [-sin  0  cos  6  +  sin  tut  cos  tnt]d(ojt) 

o 

-  (  — 1  )  C  tl— X )/ 2  C02A2  [- - )  gin  9  COg  9]  (77b) 

2  2 

From  Eq.  (73a)  when  n  is  odd  we  have 

52J  =  (-i)(a-l)/2  u2a2{[^  +  —  sin  0  cos  6]  +  [—  -  —  sin  6  cos  0)} 

2  2  2  2 

<S2 J  -  (-i)(n-l)/2  w2a2  (77c) 

In  particular  for  n  “  1,  one  obtains 

62J  =  (d2A2  >  0  (78a) 

which  gives  a  minimum  for  the  functional  J.  For  n  -  3 

<52J  -  -oi2A2  <  0  (78b) 

which  gives  a  maximum  for  the  functional  J,  It  is  noted  that  S2J  is 

independent  of  0  which  is  related  to  the  starting  conditions.  It  is  also 

independent  of  the  polarity  of  A  since  it  appears  in  terms  of  A2. 

Now  we  take  the  case  when  n  is  even.  Substituting  Eqs.  (74)  and  (76) 

into  Eq.  (73),  one  obtains 

nir/2 

62J[5y^]  -  ui<$A2/  {(sin  ait  cos  0  +  cos  (ot  sin  9) 
o 

[-  sin  ait  (-l)n/2  cos  0  +  cos  (nt(-I)n/2  sin  0]}d(wt) 
n7r/2 

=  (-l)n/2  (h2A2  /  [-  sin2  ait  cos2  9  +  cos  2  mt  sin2  0]d((ot) 

o 

=  (-l)n/2  di2A2(-  cos2  9  +  sin2  0)nir/4  (79a) 
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and 


S2J[u)Sy]  =»  u)2A2/  ^  {(cos  wt  cos  6  -  sin  wt  sin  9) 

o 

•  [cos  <i)t(-l)n/2  cos  9  +  sin  a)t(-l)n/2  sin  9]}d(u>t) 

(-1)0/2  U2A2  f1"' ‘  [cos2  wt  cos2  9  -  sin2  rnt  sin2  9]d(uit) 
o 

-  (-l)n/2  w2A2(cos2  9  -  sin2  0)mr/4 


(79b) 


From  Eq.  (73a)  when  n  is  even  we  have 

52J  =  (-l)n/2  w2A2{(-  cos2  9  +  sin2  9)  +  (cos2  9  -  sin2  9)}n7r/4 

62J  «  0  for  all  n  51  even  (79c) 

We  can  conclude  here  that  the  functional  J  definitely  [6]  has  a  minimum 
if  ojT  =»  -ir/2,  or  T  is  a  quarter  of  the  natural  period  of  the  oscillation 
x  s  2tt/ui.  Moreover,  from  Eq.  (70)  for  n  =  1 

T  *  tfo  -  tQ  55  r/( 2w)  =  x/4  (80a) 

If  n  a  2  and  <S2J  -  0  in  Eq.  (79c),  we  have 

T  -  tb  -  t0  <  w/u  -  t/2  (80b) 

This  is  the  upper  limit  of  the  increment  we  chose  for  T,  above  which  the 
minimum  of  the  functional  J  is  not  guaranteed. 

XI.  EXTREMAL  FOR  A  SIMPLY  SUPPORTED  BEAM  WITH  CONCENTRATED  LOAD  AT  THE 
MIDDLE!  To  show  that  62J  must  be  positive  semi -definite  we  use  the  example of 
a  simplysupported  beam  with  a  concentrated  load  at  the  middle.  If  the  load 
is  suddenly  removed  [7],  Eq.  (63)  becomes 


ntt  +  ^^Dxxxx  3  0 

(81a) 

Or  from  Eq.  (56)  we  have 

ytt  +  b2yxxxx  -  0 

(81b) 

The  starting  conditions 

are 

—  Q0Cx)  =  0 
dt 

(82a) 

u0(x) 

(Px/2)h 

-  Me/ 1  -  -  for  0  <  x  <  i/2 

i 

(82b) 

and 

.  ®o(x) 

PU/2)h  x 

=  -  (1  -  -)  for  i/2  <  x  <  i 

I  J l 

(82c) 
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The  solution  for  Eq.  (81b)  is 


where 


a(x,t)  -  -  Eh 


Sas  «  i 

[  -r-  (— 1 ) (n“l )/ 2  gin(mrx/ A)cos 

^  n*odd  n 


(83a) 


pn  =  bn2ir2/i2 


(83b) 


as  =  (P£/2I)(h/2) 


(33c) 


The  quantity  ag  is  the  initial  static  stress  at  the  middle  of  the 
x  =  i/2  and  on  the  outer  surface  of  the  beam. 


beam  where 


In  order  to  find  yt  we  let 


1  8as  ^  1  £  “ 

y  =  —  ~rr  l  (“t)( — )2(-l)  2  sin(mrx/ Jl)cos  pnt 
Eh  tt2  n=odd  n2  rnr 


(84a) 


Then  by  partial  differentiation  we  have 


32y  1  8as  "  1 

— --  =  -  —  — —  l  (--)(-l)  2  sin(nirx'/i)cos  pnt 

Sx2  Eh  irz  n=odd  n2 


(84b) 


which  agrees  with  Eq.  (83a),  and 


3y  1  8os  ”1  " 

—  - I  (— : r)(-l)  2  b  sin(mrx/£)  sin  pnt 

8t  Eh  ir*  n=odd  n^ 


(84c) 


where  from  Eq.  (83b) 


b  -  pnZ2/(n2iT2) 


(84d) 


PI  -  bir2/&* 


(84e) 


It  is  noted  that  the  index  n  appears  in  both  spatial  and  temporal 
Eqs.  (84a)  and  (84b)  under  the  summation  sign.  We  are  interested 
those  functions  of  t  that  are  independent  of  the  Index  n.  Let  us 


functions  in 
in  finding 
assume  that 


U/2-c) 


=  n27r/ 2  -  n2irc /  i 
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It  is  noted  that  for  n  -  1,  3,  and  5,  n2ir/2  becomes  tt/2,  4ir  +  tt/2,  and  1  2tt  + 
tr/2,  respectively. 


Thus  we  have 

cos  pnt  a  cos  [n2ir/2  -  n2irc/ 1] 

=  cos  [it/2  -  n2itc/JJ.] 

-  sin  (n2ir/c/i)  (86a) 

and 

sin  pnt  =  cos  (n2ffc/ &)  (86b) 

Moreover,  for  c/(£/2)  =  X,  1/2,  and  0 

cos  pnt  -  sin[( tt/2)c/ ( £/2) ]  =  1,  0.707,  and  0,  respectively  (87a) 

and 

sin  pnt  -  cos[(7r/2)c/(i/2)]  ■  0,  0*707,  and  1,  respectively  (87b) 

The  above  functions  are  independent  of  index  n  at  those  values  of  c/(£/2). 
Thus,  Eq.  (84)  may  be  rewritten  at  those  values  as 

32y  A  °s 

— r  (x,t=*t)  - sin( ttc/  J2-)y0(x)  (88a) 

3x2  Eh 

and 

3y  *  °s 

—  (x,t=>t)  * - cos(irc/i)y0(x)  (88b) 

9t  Eh 


where  n-l 

yQ(x)  ■  l  (-1)  2  sinCmrx/ Z)  (88c) 

naodd 

The  series  terms  in  Eq,  (88c)  are  the  result  of  an  expansion  of  a  triangular 
deflection  of  the  form 

yQ(x)  -  x/U/2)  for  0  <  x  <  1/2  (88d) 

yQ(x)  =  2  -  x/U/2)  for  1/2  <  x  <  l  (88e) 

as  shown  in  Figure  2. 

For  the  images  of  Eqs.  (84b)  and  (84c)  the  time  dependent  terms  become 

cos  pnt(T— t)  =  cos(pnT-pnt)  (89a) 

and 

sin  pn(T-t)  =  sin(pnT-pnt)  (89b) 
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The  term  pnT  can  be  obtained  from  Eqs.  (84d)  and  (84e)  as 

pnT  -  (bn2x2M2)T  =  n2piT 

For  computation  by  the  finite  element  method  the  increment  in  time 
taken  as  T  which  is  defined  as 


where 


T  »  tb  -  t0  »  (m/pi) ( tt/ 2) 
m  *  1,2,3*.. 

Then  with  the  aid  of  Eqs.  (85),  (89c),  and  (90a)  we  have 

pn(T-t)  -  mn2(ir/2)  -  [n2Tr/2  -  n2irc/£] 

-  (m-l)n2(Tr/2)  4-  n2c/& 

Then  for  the  case  when  m  =  1,  the  time  dependent  terms  become 

cos  prx(T-t)  =  cos(n2irc/£) 

and 

sin  Pn(T-t)  =  sin(n2irc/i) 

By  similar  method  we  can  obtain 

32y  A  <*s 

— -  (x,t=T-t)  -  -  “  cos ( ftc/  £)y0(x) 


(89c) 

is 

(90a) 

(90b) 


9x 


Eh 


3y  A  has 

r~  (x,t=T-t)  - - sin(7Tc/i)y0(x) 

°  t  Eh 


(90c) 

(91a) 

(91b) 

(92a) 

(92b) 


For  the  partial  differential  equation  we  have  the  second  variation  from  Eq. 
(52)  which  gives 

«  a 

62J  *  /  [<$yt(x,t»t)6yt(x,t*T-t) 

o 

A  A 

+  b2Syxx(x,t-t)  5yxx(x,t=»T-t)  ]dt  (93) 

Separating  Eq.  (93)  into  two  parts  and  using  Eqs.  (88)  and  (92),  we  have 

62J  =  <S2J[5ytl  +  62J[b6yxx]  (94) 

The  first  term  on  the  right  of  Eq.  (94)  is 

PIT-tt/2 

S2J[$yt]  ~  J  (bas/Eh)2y02(x)dx/  cos(  ire/ £)sin(  wc/ Ji)d(  pit) 

x0  o 


*b 

2  /  (b0g/Eh)2yo2(x)dx  >  0 

x0 


(95a) 
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where 


pit  -  tt/2  -  t rc/£  d(pit)  *  ^<Tr/A)dc 


(95b) 


Pit  =  tt/2  ,  c/(£/2)  =  0  ,  W&  =  0 


(95c) 


Pit  -  0  ,  c/(A/2)  =  1  ,  ttc/ A  *  ff/2 

The  second  term  of  Eq.  (94)  is 

52J[b5yxx]  =«  /  b  (bas/Eh)2y02(x)dx/1T/2  sin(TTc/JO  cos(xc/£)d( ire/*) 
xQ  o 

=  /  b  (bas/Eh)2y02(x)dx  >  0 


(95d) 


(95e) 


Thus  by  combining  Eqs.  (95a)  and  (95e)  ,  one  obtains 


52J  =5  j  2(bas/Eh)2y02(x)dx  >  0 


which  gives  a  minimum  for  the  functional  J* 

Now  we  take, the  case  when  ra  =  2,  Then  Eqs.  (90c)  and  (89)  become 

pn(T-t)  =  n2n/2  +  n2irc/  &  (97a) 

cos  pn(T-t)  =  cos(n2Tr/2-l-n2irc/JO  ,  n2  *  1,9,25,  etc. 

=  -sin(n2Trc/£)  (97b) 


sin  pn(T-t)  *  sin(n2Tr/2+n2TTc/£)  ,  n2  *  1,9, 25, etc. 


cos(n2xc/ £) 


Thus  the  image  function  becomes 


(97c) 


32y  *  as 

— -  (x jt^T-t)  - - (-sin  ttc/ Z)yQ(x) 

3x2  Eh 


(98a) 


3  y  A  °s 

—  (x ,t=T-t)  - - cos(ttc/ Jl)y0(x) 

3t  Eh 


(98b) 
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By  substituting  Eqs.  (88)  and  (98)  into  Eq.  (94)  we  have 


xb  tt/2 

SZJ  a  J  (bas/Eh)2y02(x)dxJ 


[cos2(xc/2)  -  sin2(irc/i.)  ld(nc/Jl) 


a  /  (bos/Eh)2y02(x)dx(-)[sin  it  -  sin  0)  =0  (99a) 

x0  2 

We  can  conclude  here  that  the  functional  J  definitely  [8]  has  a  minimum  if 
PIT  =  it / 2 ,  where  T  is  a  quarter  of  the  natural  period  of  the  oscillation 
x  =  2ir / p i .  Moreover,  from  Eq.  (90a)  for  m  =  1,  we  have 

T  =  tb  -  t0  =  tr/(2pi)  -  t/4  (99b) 

If  in  =  2  and  <S2J  =  0  in  Eq.  (99a),  we  can  conclude  that 

T  -  tb  -  t0  <  it / P1  =  T/2  (99c) 

This  is  the  upper  limit  of  the  increment  we  choose  for  T,  above  which  the 
minimum  of  the  functional  J  is  not  guaranteed. 

XII.  CONCLUSIONS.  The  functional  in  bilinear  form  is  symmetrical  about 
the  original  variables  and  the  adjoint  variables.  The  Euler  Lagrange 
equations  for  the  original  and  the  adjoint  systems  are  derived  using  the 
fundamental  lemma  of  the  calculus  of  variations.  By  integrating  the  bilinear 
expression  by  parts,  one  can  obtain  the  bilinear  concomitant  in  terras  of  * 
initial  and  boundary  terms.  The  adjoint  system  can  be  arranged  in  a  manner 
that  it  is  a  reflected  mirror  of  the  original  system  in  time.  Thus  the 
initial  conditions  for  the  bilinear  concomitant  become  zero. 

Generalized  boundary  conditions  using  many  types  of  "springs"  relating 
the  various  spatial  partial  derivatives  are  defined  to  satisfy  the  boundaries 
of  the  concomitant.  The  higher  partials  in  original  variables  and  variations 
in  the  adjoint  variables  can  be  kept  In  low  orders  by  these  "springs". 
Algorithms  are  developed  for  use  in  the  finite  element  method  by  taking  the 
first  variations  of  the  functional.  These  algorithms  are  simplified  because 
the  adjoint  system  gives  exactly  the  same  solutions  as  that  of  the  original 
system. 

Sensitivity  coefficient  is  found  to  be  similar  to  the  variation  of  the 
variable  and  obeys  the  same  partial  differential  equation.  The  solution  of 
the  original  p.d.e.  is  taken  as  the  solution  of  the  variations  for  two 
examples,  a  simple  oscillator  and  a  simply-supported  beam  with  load  at  the 
middle.  It  is  found  that  the  second  variation  of  the  functional  Is  positive 
serai-definite  if  the  increment  in  time  for  the  finite  element  method  Is  less 
than  half  the  natural  period  of  the  physic  systems  in  both  cases.  This  will 
guarantee  a  minimum  for  the  functional  and  thus  the  method  is  truly  workable 
If  employed  as  algorithms  for  the  finite  element  method. 
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ABSTRACT 

To  evaluate  performance  of  enclosure  structures  due  to  an  internal  blast, 
the  dynamic  response  of  a  continuous  hemispherical  configuration  has  been  com¬ 
pared  to  the  response  of  the  same  configuration,  but  with  a  large  access  opening. 
The  results  indicate  substantial  alteration  of  the  response  behavior  and  stress 
concentration  effects  in  the  vicinity  of  the  cutout  region. 

INTRODUCTION 

The  Ballistic  Research  Laboratory  is  currently  in  the  process  of  acquiring 
a  target  enclosure  to  facilitate  destructive  terminal  ballistic  testing  of  chem¬ 
ical  explosives  (CE) ,  armor,  and  kinetic  energy  (K£)  penetrators  by  safe  con¬ 
tainment  of  blast,  fragments,  and  resultant  harmful  combustion  products.  The 
present  investigation  is  an  extension  of  the  work  reported  in  Reference  1,  where 
a  simplified  continuous  configuration  was  considered.  A  concept  layout  of  the 
firing  range  is  shown  in  Figure  1.  A  detailed  description  of  the  layout  is  given 
in  Reference  2. 

A  salient  feature  of  the  structure  is  a  large  sliding  door  with  a  config¬ 
uration  to  match  the  curvature  of  the  hemispherical  wall.  The  primary  function 
of  the  door  is  to  allow  equipment  access  inside  the  enclosure.  The  door  is 
sealed  to  the  wall  with  a  pressurized  hose  seal  along  its  perimeter.  Addition¬ 
ally,  an  air  exhaust  system  mounted  at  the  rear  of  the  structure  operates  during 
the  test  and  draws  aerosolized  material  out  of  the  enclosure  after  a  test  and 
traps  it  in  filters  in  the  exhaust  ducting.  The  entire  structure  is  built  to 
contain  blast  and  fragments,  to  trap  aerosolized  materials,  and  to  permit  photo¬ 
graphic  observation  of  the  test. 

For  a  continuous  shell  configuration  clamped  to  a  horizontal  rigid  founda¬ 
tion,  the  three-dimensional  problem  was  reduced  to  a  two-dimensional  axisyrametric 
configuration  by  considering  a  single  pie-shaped  segment  of  the  hemispherical 
enclosure.  The  entire  structure  was  subsequently  generated  through  360-degree 
rotation  of  the  segment  about  the  vertical  axis  of  symmetry  resulting  in  quite 
economical  computer  calculations.  However,  the  analysis  suffered  from  a  short¬ 
coming  inasmuch  as  the  results  could  not  be  extrapolated  to  a  discontinuous  shell 
configuration  with  cutouts  and  access  opening  due  to  nonlinear  geometric  char¬ 
acteristics.  The  existence  of  high  stress  concentrations  at  the  comers  can  re¬ 
duce  the  margin  of  safety  and  result  in  permanent  deformation  in  these  regions. 
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The  structural  integrity  of  a  discontinuous  configuration  could  not  bo  assured 
with  certainty  from  the  study  of  a  continuous  structure  and  a  detailed  three- 
dimensional  analysis  with  inclusion  of  the  cutout  was  deemed  to  be  necessary. 

The  current  study  is  devoted  to  a  demonstration  of  feasibility  of  modeling 
large  scale  structures  with  cutouts  and  determination  of  the  influence  of  a 
relatively  large  cutout  on  the  response  behavior  of  a  containment  structure 
using  a  comparative  evaluation  of  a  discontinuous  enclosure  relative  to  a  con¬ 
tinuous  configuration. 

For  an  enclosure  structure  in  a  test  facility,  entrance  hole  opening  for  in¬ 
coming  projectiles  and  duct  openings  for  filtration  equipment  at  the  rear  of  the 
structure  are  necessary  in  addition  to  personnel  and  equipment  access  openings. 
These  openings  are  relatively  small  compared  to  the  large  rectangular  equipment 
access  door  opening.  Although  these  openings  have  been  included  to  allow  for 
venting  of  internal  pressure  to  the  ambient  atmosphere,  they  are  ignored  in  the 
simplified  finite-difference  model  under  the  assumption  that  relative  to  the 
large  door  opening,  they  have  rather  small  influence  upon  structural  response. 
Taking  advantage  of  this  approximation  and  the  resulting  lateral  symmetry,  only 
one  half  of  the  structure  and  the  door  opening  have  been  considered  and  multiple 
hole  interaction  effects  have  been  excluded. 

Previous  work3’5  on  modeling  of  structures  with  cutouts  involved  relatively 
simple  structures  with  symmetrically  located  cutouts  of  rather  simple  shapes. 

The  current  improvements  in  modeling  techniques  are  significant  insofar  as  large 
scale  three  dimensional  structures  with  relatively  large  cutouts  of  various 
shapes,  sizes,  and  locations  subjected  to  complex  loading  conditions  were 
analyzed  without  difficulty. 

ESTIMATION  OF  TRANSIENT  AND  QUASI-STEADY  LOADS 

A  major  problem  associated  with  the  enclosed  range  tests  is  the  overpressure 
that  results6  from  the  very  rapid  heating  of  air  within  the  enclosure  as  the 
penetrator  and  the  target  are  tom  apart  during  the  encounter.  The  structure 
and  the  seais  oust  bear  the  load  for  the  entire  life  cycle  of  the  range  without 
catastrophic  failure  of  critical  regions  during  loading  and  unloading  phases 
until  the  pressure  is  vented  and  the  products  are  cooled  through  mass  and  heat 
loss  mechanisms  to  ambient  conditions. 

Since  the  key  elements  of  the  firing  ranges  are  the  enclosure  structures, 
the  structural  analysis  group  of  the  Blast  Dynamics  Branch  at  the  Ballistic  Re¬ 
search  Laboratory  (BRL),  was  assiped  the  task  of  estimating  the  overpressure 
loading  on  the  wall  and  assure  structural  integrity  from  a  conservative  view¬ 
point.  The  choice  of  a  hemispherical  configuration  was  due  to  an  earlier  in¬ 
vestigation  by  N.  J.  Huffington,  et.al.7  who  studied  a  continuous  hemispherical 
configuration. 

Since  quasi- steady  and  dynamic  pressures  upon  the  wall  are  not  expected 
to  be  significantly  altered  due  to  the  closed  equipment  door  during  a  test,  the 
internal  loading  data  in  Reference  1  was  used  for  both  continuous  and  discon¬ 
tinuous  structures.  The  salient  features  of  the  2S.4mm  thick  hemispherical 
shell  design  and  the  contoured  equipment  access  door,  4.267m  wide  and  S.486ra 
high  are  described  in  References  2  and  8. 

DYNAMIC  RESPONSE  ANALYSIS 

Response  of  both  continuous  and  discontinuous  structures  subjected  to  in¬ 
ternal  pressurization  from  a  centrally  located  explosive  blast  was  simulated 
using  a  BRL  version  of  the  PETROS  3.S  computer  program3  which  employs  the 
finite-difference  method  to  solve  the  nonlinear  equations  governing  finite- 
amplitude  elasto-plastic  response  of  thin  Kirchhoff  shells.  The  model  is  valid 
for  large  deflections  and  can  be  employed  to  treat  the  entire  structure  rather 
than  a  small  section. 

Material  Model.  The  uniaxial  tensile  quasi-static  stress-strain  property 
of  1020  steel  described  in  Reference  7  was  used  for  primary  vessel  material  in 
this  analysis.  The  material  was  modeled  in  the  code  as  a  combination  of  three 
linear  segments  followed  by  a  perfectly-plastic  behavior  and  linear  elastic 
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unloading,  resulting  in  a  polygonal  approximation  of  the  experimental  data.  The 
strain  hardening  part  of  the  stress' strain  curve  is  generated  by  a  sublayer 
hardening  model®  from  a  weighted  combination  of  elastic  perfect ly-plastic  curves 
yielding  a  piecewise  multilinear  hardening  representation.  Strain-rate  effects 
were  neglected,  which  Is  conservative  since  these  effects  tend  to  increase  the 
structural  resistance  and  thus  reduce  the  total  deformation. 

Finite  Difference  Model.  To  add  capability  to  handle  discontinuous  geometry, 
the  PETROS  5.5  code  was  modified  substantially  at  BRL.  For  computational  pur¬ 
poses,  the  boundary  conditions  from  the  outer  edges  of  the  shell  configuration 
were  applied  selectively  to  the  inner  boundaries  of  the  cutout.  The  boundary 
conditions  at  the  edges  of  the  cutout  can  be  symmetric,  clamped,  hinged,  or  free 
or  any  combinations  of  these.  Capability  for  modeling  multiple  cutouts  is 
available  to  facilitate  study  of  interaction  effects  provided  a  minimum  number 
of  meshpoints  are  allowed  between  cutouts. 

To  simplify  the  model  of  the  enclosure  with  the  rectangular  cutout,  only  one 
half  of  the  cutout  and  the  structure  was  considered  due  to  lateral  symmetry. 

A  total  of  22  meshpoints  in  the  circumferential  and  20  meshpoints  in  the  hoop 
direction  were  used,  except  in  the  cutout  regions  wherein  only  12  meshpoints 
were  used.  Each  meshpoint  used  for  Gaussian  integration  points  through  a  single 
thickness  layer.  The  meshpoints  at  the  base  of  the  structure  and  along  the 
periphery  of  the  cutout  were  restrained  from  movement  in  both  axial  and  trans¬ 
verse  directions.  Clamped  edge  conditions  were  imposed  along  the  cutout  boun¬ 
dary  along  which  the  door  must  be  tightly  clamped  to  the  enclosure  wall  during 
a  test.  All  other  meshpoints  were  allowed  unrestrained  movement  in  any  direc¬ 
tion.  Only  the  nodes  were  subjected  to  blast  pressure  in  the  radially  outward 
direction. 

For  the  continuous  structure,  only  a  quarter  segment  was  modeled  using  18 
equal  width  meshes  along  the  surface  and  a  single  layer  through  the  thickness 
to  represent  the  pie  shaped  segment.  Four  Gaussian  integration  points  through 
the  thickness  were  used  at  each  meshpoint. 

RESULTS  AND  DISCUSSION 

The  deformed  model  of  the  discontinuous  hemispherical  shell  configuration 
relative  to  the  undeformed  configuration  shows  a  peak  deflection  of  l.28mm  in 
the  radially  outward  direction  at  approximately  17  ms  in  Figure  2a  which  is 
comparable  to  a  peak  displacement  of  1.17mm  observed  at  approximately  56  ms  for 
the  continuous  enclosure  shown  in  Figure  2b.  However,  in  addition  to  the 
radially  outward  displacement,  the  discontinuous  configuration  exhibits  a 
lateral  displacement  component  with  a  highly  reduced  peak  of  ,16mm  indicating 
a  tendency  of  the  pole  to  shift  laterally  by  a  small  amount.  This  behavior  is 
in  marked  contrast  to  the  single  degree  of  freedom  motion  in  the  radial 
direction  for  the  continuous  configuration1  where  the  pole  exhibits  no  lateral 
movement.  The  resultant  peak  deflection,  which  is  obtained  by  vectorial  suma- 
tion  of  individual  displacement  components,  is  found  to  be  1.29mm  predominantly 
in  the  radially  outward  direction  at  the  pole  for  the  discontinuous  enclosure 
and  is  nearly  9%  larger  than  corresponding  polar  deflection  for  the  continuous 
structure  with  identical  geometric  and  material  parameters.  However,  this 
displacement  is  less  than  4.5%  of  the  shell  thickness  for  both  structures  so 
that  geometric  nonlinearities  are  insignificant.  The  peak  deflection  is  of 
the  order  of  elastic  deflection  at  the  pole  and  the  residual  deformation  at 
the  pole  is  negligible  after  elastic  oscillations  are  damped  out  and  internal 
pressure  is  released. 

Energy  balance  studies  for  both  configurations  using  the  code  confirmed 
absence  of  plastic  work  and  numerical  instability.  Both  total  and  kinetic 
energies  were  bounded.  The  fluctuations  of  kinetic  energy  appeared  to  have 
twice  the  frequency  of  the  work  performed  by  the  internal  blast  pressure. 

Transient  circumferential  and  meridional  surface  strains  at  both  outer 
and  inner  walls  near  the  pole  for  the  continuous  hemispherical  enclosure1  are 
in  phase  and  approximately  equal  in  magnitude  indicating  domination  by  mem¬ 
brane  effects  due  to  elastic  vibration  of  the  wall  in  the  breathing  mode  as 
shown  in  Figure  3a  for  the  inner  surface.  Strain  components  at  the  clamped 
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DISPLACEMENT  AT  fOlE  (mm) 


1.2 


(a)  Displacement  components  at  the  pole  of  the  discontinuous  hemisphere 


Figure  2,  Transient  displacements  at  the  pole  of  the  continuous  and 
discontinuous  hemispheres ■ 


241 


(a)  Meridional  and  circumferential  strains  at  the  inner  surface  near 
the  pole  of  the  continuous  hemisphere 


(b)  Meridional  and  circumferential  strains  at  the  inner  surface  near 
the  pole  of  the  discontinuous  hemisphere 

Figure  3.  Comparison  of  surface  strains  near  the  pole  of  the  continuous  and 
discontinuous  hemispheres . 
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edge  at  the  base  are  in  general  out  of  phase  and  unequal  in  magnitude  due  to 
flexural  deformation.  Substantial  weakening  of  the  flexural  wave  was  observed 
near  the  pole.  However,  for  the  discontinuous  shell  configuration,  noticeable 
bending  effect  in  addition  to  stretching  was  observed  as  far  as  one  mesh  away 
from  the  pole.  Typical  strain  response  at  this  location  at  the  inner  surface 
is  shown  in  Figure  3b. 

Strains  were  computed  at  the  midpoint  of  the  top  edge  of  the  rectangular 
cutout  as  well  as  at  the  comer  point  to  assess  the  influence  of  the  cutout  on 
the  response  behavior  of  the  structure.  Meridional  strains  were  approximately 
three  times  as  high  as  the  circumferential  strains  at  both  outer  and  inner  sur¬ 
faces  at  the  midpoint  of  the  top  edge  of  the  cutout.  A  peak  meridional  strain 
of  .019%  at  approximately  10  ms  was  observed  at  the  inner  wall  in  contrast  to 
.015%  at  the  outer  wall.  A  similar  response  due  to  combined  stress  concentra¬ 
tion  and  bending  effect  from  the  clamped  edges  was  observed  at  the  comer. 
However,  meridional  strains  were  in  general  of  the  same  order  of  magnitude  as 
the  circumferential  strains  at  this  location  at  both  outer  and  inner  walls. 

At  a  point  one  mesh  away  from  the  pole,  circumferential  and  meridional 
strains  were  nearly  equal  and  in  phase.  A  peak  circumferential  strain  of  .020% 
was  observed  at  18  ms  at  this  location  at  the  outer  wall  in  contrast  to  a 
maximum  of  .018%  strain  at  the  inner  wall  indicating  propagation  of  bending  wave 
from  the  clamped  cutout  edge  to  the  pole.  Thus  both  peak  strains  and  deflection 
were  found  to  occur  simultaneously  at  the  pole  for  a  discontinuous  configuration. 
This  is  in  direct  contrast  to  the  behavior  of  the  doorless  enclosure  configura¬ 
tion  in  which  peak  strains  developed  at  the  clamped  edge  at  the  base  due  to 
combined  bending  and  stretching  while  peak  deflections  occurred  at  the  pole 
due  to  focusing  of  vibratory  energy.  The  development  to  stress  concentration 
and  bending  effects  of  the  clamped  cutout  edges.  However,  in  all  cases  strains 
were  small  enough  to  be  in  the  linear  elastic  regime  and  peak  stresses  calculated 
from  elastic  theory  did  not  exceed  40  MPa.  Margins  of  safety  at  critical  re¬ 
gions  based  on  a  yield  strength  of  241  MPa  for  mild  steel  were  equal  to  or 
greater  than  5.0  which  was  found  to  be  satisfactory. 
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ABSTRACT .  In  a  front  tracking  scheme  certain  waves  are  given  a 
more  exact  treatment  through  the  introduction  of  a  dynamically 
moving,  lower  dimensional  grid  that  Is  aligned  with  the  wave 
fronts.  These  tracked  waves  are  propagated  in  time  through  the 
solution  of  Rankine-Hugoniot  relations;  the  remaining  waves  are 
propagated  using  finite  difference  equations.  In  this  paper  we 
discuss  the  coupling  between  these  two  wave  systems. 

INTRODUCTION.  Front  tracking [3 ,6]  is  a  method  for  numerical  solution  of 
hydrodynaraical  problems  whose  basic  element  is  a  system  of  moving  curves 
(i.e.  the  front).  Each  curve  represents  a  physical  wave.  For  gas  dynam¬ 
ics,  these  waves  are  either  shock  waves  (nonlinear  sound  waves)  or  contact- 
discontinuities  (temperature  jumps  or  slip  lines).  The  motion  of  each  type 
of  wave  is  determined  by  Rankine-Hugoniot  equations  that  relate  the  states 
on  the  two  sides  of  the  wave.  In  the  interior  regions,  i.e.  the  portions 
of  space  disjoint  from  the  curves,  the  solution  is  thought  of  as  being 
smooth  (or  relatively  smooth,  with  only  small  discontinuities).  Therefore 
the  interior  solution  can  be  calculated  well  using  finite  difference 
methods. 

Two  further  issues  remain  to  be  specified  before  the  front  tracking  scheme 
is  defined.  The  first,  which  is  the  subject  of  this  paper,  Is  the  Interac¬ 
tion  between  tracked  and  interior  waves.  The  second,  not  discussed  here, 
is  the  interaction  between  two  or  more  Intersecting  tracked  waves,  where 
the  tracked  waves  do  not  define  a  problem  that  is  locally  one-dimensional. 

2.  THE  NATURE  OF  THE  COUPLING  OF  INTERIOR  AND  TRACKED  WAVES .  During  a  time 
step  [t,t  +  At] ,  Interior  waves  may  reach  one  of  the  tracked  waves.  When 
this  occurs,  the  interior  wave  can  be  transmitted,  reflected,  and  absorbed. 
Also  the  tracked  wave  can  spontaneously  radiate  waves  into  the  interior. 
(For  example,  curvature  of  the  front  causes  this.)  Within  the  tracked 
wave,  there  Is  both  normal  and  tangential  motion.  The  normal  motion  is 
specified  by  the  Rankine-Huguniot  equations.  The  tangential  motion 
corresponds  to  surface  waves  and  curvature  effects. 
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For  computational  simplicity ,  we  wish  to  decouple  the  front  and  interior 
calculations,  as  much  as  it  is  scientifically  correct  to  do  so.  The  extent 
of  the  front-interior  coupling  is  determined  by  the  domain  of  dependence  of 
the  moving  front.  Stated  differently,  we  need  to  know  all  waves  within  a 
distance 


x  -  0(AO 

of  the  front  in  order  to  propagate  the  front  through  the  time  interval  At, 
To  accomplish  this,  we  introduce  local  normal  and  tangential  coordinates, 
and  then  use  a  fractional  step  (operator  splitting)  method  to  advance  the 
front,  first  normally  and  then  tangentially. 

3_.  THE  DATA  STRUCTURE.  The  data  structure  of  the  computation  consists  of 
state  variables  (density,  etc.)  specified  at  points  of  a  regular  rectangu¬ 
lar  grid,  together  with  double-valued  state  variables  (left  density,  right 
density,  etc.)  specified  at  irregular  mesh  points  on  a  moving  front.  To 
define  the  state  variables  at  an  arbitrary  point  x  we  interpolate  between 
these  regular  and  irregular  points.  Because  the  front  divides  the  plane 
into  connected  components,  it  is  important  to  interpolate  only  data 
corresponding  to  the  component  to  which  x  belongs;  the  following  procedure 
is  used. 

First  the  regular  mesh  block  containing  x  is  determined.  If  all  four 
corners  have  the  same  component  as  x  then  the  state  at  x  is  given  by  bil¬ 
inear  interpolation.  If  three  or  fewer  corners  have  the  same  component  as 
x  then  the  mesh  block  is  triangulated,  the  triangle  in  which  x  is  located 
is  found,  and  the  state  at  x  is  calculated  by  linear  interpolation  from  the 
states  at  the  corners  of  this  triangle.  The  triangulation  is  chosen  so 
that  each  triangle  lies  entirely  within  a  single  component, 

THE  NORMAL  SWEEP  TO  PROPAGATE  THE  FRONT.  At  each  mesh  point  on  the 
front,  a  normal  direction  ft  is  defined.  The  solution  at  time  t  is 
evaluated  at  this  mesh  point  (where  it  is  double  valued,  so  that  two 
states,  left  and  right,  are  obtained).  The  solution  at  time  t  is  also 
evaluated  at  a  normal  distance  Ax  on  each  side  of  the  front.  These  states 
will  be  used  to  calculate  the  waves  that  impinge  on  the  front  from  the 

interior.  Notice  that  if  the  front  contains  curves  too  close  to  each 

other,  these  new  evaluation  points  x  +  Ax  *  n  may  be  in  different  com¬ 
ponents  from  the  mesh  point  x  +  0  *  n  at  the  front.  In  this  case  the 

evaluation  point  x  +  ax  *  n  is  shifted  into  the  correct  component.  (Con¬ 
ceptually,  the  state  at  a  point  outside  a  given  component  is  obtained  by 
extrapolation. )  Thus  we  always  have  as  data  two  left  states  corresponding 
to  one  component  and  two  right  states  corresponding  to  a  second  component. 

These  states  are  used  as  initial  data  for  an  extended  or  non-local  Rieraann 
problem.  Higher  order  solutions  of  these  Riemann  problems  have  been  dis¬ 
cussed  before.  To  reduce  sampling  error  in  the  random  choice  method,  a 
steady  state  Ansatz  has  been  used [4, 2]  to  extend  local  Riemann  data  over  a 
mesh  block,  thereby  obtaining  a  non-local  Riemann  problem,  which  is  solved 
to  higher  order.  Higher  order  Godunov  schemes [1]  also  employ  Ideas  related 
to  solutions  of  Riemann  problems.  In  the  present  scheme  we  solved  the 
non-local  Riemann  problem  as  follows. 
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Using  the  left  and  right  states  located  at  the  front  we  solve  an  ordinary 
Riemann  problem*  The  solution  is  the  correct  answer  to  the  non-local  prob¬ 
lem  at  time  t  +  0,  so  it  is  used  to  approximate  the  propagation  speeds  of 
the  characteristics  that  enter  the  two  sides  of  the  tracked  wave*  By  fol¬ 
lowing  these  characteristics  backward  from  t  +  A x.  to  t  we  find  their  start¬ 
ing  points  in  the  normal  intervals 

[x  +  0  *  n,  x  +  |Ax|  *  n] . 

The  states  at  these  points  are  calculated  using  linear  interpolation 
between  the  states  on  the  front  and  the  states  at  the  normally  displaced 
points.  In  this  way  we  determine  which  waves  from  the  normal  intervals 
enter  the  front.  Using  differential  equations  in  characteristic  form  we 
compute  corrected  left  and  right  states  on  the  front  at  time  t  +  At  -  0. 
These  corrected  states  define  a  new  Riemann  problem,  whose  solution  gives 
states  to  be  associated  with  the  propagated  front  at  time  t  +  At. 

5_.  THE  TANGENTIAL  SWEEP  ALONG  THE  FRONT.  By  linear  interpolation,  the  dou¬ 
bled  valued  variables  can  be  defined  at  all  the  points  of  the  front. 
Therefore  we  can  evaluate  the  normally  propagated  solution  at  each  mesh 
point  and  at  two  neighboring  points  displaced  a  distance  Lx  along  the 
front.  Using  these  three  stencil  points  and  the  one-dimensional  Lax- 
Wendroff  scheme  we  determine  the  tangentially  propagated  state  variables. 
Notice  that  tangential  propagation  of  the  points  on  the  front  is  equivalent 
to  a  remeshing  of  the  front,  in  the  limit  <-x  ->  0,  so  it  is  not  essential 
to  move  these  points  during  the  tangential  sweep. 

6_.  THE  INTERIOR  SCHEME  NEAR  TRACKED  WAVES .  For  the  calculation  of  the 
solution  in  the  interior  regions  we  use  the  two-dimensional  Lax-Wendroff 
scheme,  which  involves  two  half  steps.  To  facilitate  the  coupling  of  the 
front  and  interior,  the  front  is  also  advanced  in  half  steps,  so  Its  posi¬ 
tion  and  state  variables  are  known  at  t  +  At/2. 

Since  the  Lax-Wendroff  scheme  Is  a  leapfrog  composition  of  two  Lax- 
Friedrichs  steps  it  is  enough  to  describe  the  Lax-Friedrichs  scheme.  This 
scheme  usually  assumes  the  initial  data  to  be  known  at  the  four  corners  of 
a  square.  In  our  application,  however,  there  are  irregular  squares,  i.e. 
ones  for  which  one  or  more  of  its  corners  is  cut  off  by  the  front  and  thus 
lies  in  the  wrong  component.  To  circumvent  this  difficulty  we  view  the 
Lax-Friedrichs  scheme  as  defined  by  a  flux  balance:  the  sum  of  the  fluxes 
through  the  sides  of  a  mesh  block  determines  the  change  in  time  of  a  con¬ 
served  quantity  integrated  over  the  block.  From  this  viewpoint  the  Lax- 
Friedrichs  scheme  is  defined  for  irregular  squares  as  well  as  regular 
squares.  In  fact,  the  propagation  of  the  front  also  determines  the  fluxes 
through  the  front  and  thereby  through  the  sides  of  the  irregular  squares, 

7^.  TESTS  AND  VALIDATION.  The  present  implementation  of  the  above  scheme  is 
incomplete,  with  the  coupling  of  the  interior  scheme  to  the  tracked  waves 
(Sec.  5)  replaced  by  a  more  primitive  version.  Full  implementation  and 
validation  represents  work  in  progress.  Here  we  show  preliminary  results 
for  the  supersonic  flow  past  a  wedge  obstacle  in  a  channel,  using  for  com¬ 
parison  the  steady-state  solution  obtained  using  the  method  of  characteris¬ 
tics  [5].  In  this  flow  a  bowshock  Interacts  with  a  Prandtl-Meyer  expansion. 
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The  test  flow  was  simulated  using  a  50  by  50  grid  for  200  time  steps.  (The 
stability  requirements  for  the  finite  difference  schemes  dictate  that  the 
downstream  signal  speed  be  approximately  .5  grid  blocks  per  time  step.) 

The  initial  data  was  a  slight  perturbation  of  the  steady  state  solution 
obtained  using  the  method  of  characteristics.  In  Figs,  la  and  lb  we  show 
the  initial  and  final  shock  positions  and  isopycnic  (constant  density)  con¬ 
tours.  In  Figs.  2a  and  2b  we  show  the  initial  and  final  density  distribu¬ 
tions  along  two  sides  of  the  shock.  Figs.  3a, b  and  4a, b  show  the  analogous 
distributions  along  the  wedge  and  along  the  portion  of  the  exit  below  the 
shock,  respectively.  These  figures  indicate  that  the  front  tracking  scheme 
accurately  reproduces  the  steady  state  solution. 
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A  condition  on  the  numerical  flux  for  approximations  to  scalar,  non- 
convex  conservation  laws  is  introduced,  and  shown  to  guarantee  convergence 
to  the  correct  physical  solution.  These  considerations  lead  to  a  simple, 
closed  form,  analytic  expression  for  the  solution  to  the  Riemann  problem 
for  scalar,  nonconve^q  conservation  laws. 

A  systematic  approach  is  presented  for  converting  these  first  order 
accurate  convergent  approximations  to  second  order  accurate,  variation 
diminishing,  entropy  condition,  satisfying  approximations.  The  technique 
is  extended  to  systems  of  equations  of  inviscid,  compressible  flow  in 
general  geometries,  using' a  high  resolution  version  of  the  author's  scheme 
The  results  of  such  multidimensional  calculations  are  given  in  [2]. 

The  work  described  in  the  second  paragraph  is  joint  with  S. 

Chakra varthy  [2] . 
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For  a  single  scalar  conservation  law 
(1)  u.  +  f(u)  =0,  0  <  t  <  “  3  x  <  « 

"C  X 

for  general  nonconvex  f(u)  we  consider  the  Riemann  problem. 

u(x?0)  s  uL  5  x  <  0 

=  uR  ,  x  >  0 


uR  arbitrary  constants. 

It  is  well  known  e.g.  [3]  that  there  exists  a  unique  solution  to  (l) 
of  the  form 

u(x,t)  =  f  )  =  u(c) 

T  R 

taking  on  values  between  u  and  u  and  satisfying  the  entropy 
condition. 

We  present  here  a  closed  form  expression  for  the  solution  via: 

L  R 

Lemma  1.  if  u  <  u  ,  then 

(2)  (a)  f(u(c))  -  £u(£)  =  [f(w)-£v] 

we[u  ,u  ] 

..  L  R 
if  u  >  u  3 

then 

(b)  f(u(c»  -£u(£)  =  [f(w)-£^. 

we[u  ,u  J 
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Theorem  1. 


(3)  (a)  If  u^  <  uR,  then 


uU)  (  min  [f(w)-Cw] 

&  r  L  Rn 

\W€[u  }U  ] 


(b)  If  uL  >  uR,  then 


u(C)=-d7f  ^  [f(w)-£w] 

Q  r  R  L, 

\we[u  ,u  J 


For  proofs,  see  [2],  In  that  reference,  we  also  obtain  a  simple 
condition  on  the  numerical  flux  for  semi -discrete  approximations  to  (l), 
which  guarantees  convergence  to  the  correct  physical  solution.  This 
seems  to  be  the  most  general  class  of  convergent  schemes  known. 


BIBLIOGRAPHY 


[1]  S.  OSHER,  Riemann  Solvers,  the  entropy  condition,  and  difference 
approximations.  SINTJM  (to  appear). 

[2]  S.  OSHER,  and  S.  CHAKRAVARTHY ,  High  resolution  schemes  and  the 
entropy  condition,  (submitted  to  SINUM) 

[3]  P.D.  LAX,  Shock  waves  and  entropy,  in  ’’Contributions  to  nonlinear 
functional  analysis ",  E.H.  Zarontonello,  Editor,  Acad.  Press, 

New  York  (1971),  pp.  603-63^. 


257 


ARTIFICIAL  MASS  CONCEPT  AND 
TRANSONIC  VISCOUS  FLOW  EQUATION 


George  S.  Dulikravich 
Department  of  Aerospace  Engineering 
and  Engineering  Mechanics 
University  of  Texas  at  Austin 
Austin,  Texas  78712 


Peter  Niederdrenk 
Institut  FUr  Theoretische 
Strbmungsmechanik 
DFVLR-AVA 

Gdttingen,  F.R,  Germany 


ABSTRACT .  By  varying  the  grid  clustering  on  the  surface  of  an 
airfoil,  it  was  observed  that  symmetric  shocked  solutions  develop  with  a 
nonunique  shock  strength  and  location  when  numerically  solving  the  full 
potential  equation. 

It  is  shown  analytically  that  the  conventional  form  of  artificial 
density  (or  viscosity)  produces  a  number  of  truly  nonlinear  terms  which 
are  suspected  to  be  the-  cause  of  the  nonuniqueness  for  all  the  finite 
grid  sizes.  A  concept  of  artificial  mass  flow  is  shown  to  be  suitable 
for  analytically  evaluating  a  new  exact  form  of  the  switching  function 
that  eliminates  all  the  nonlinear  terms  for  any  value  of  the  local  Mach 
number.  The  resulting  expanded  full  potential  equation  then  becomes  a 
third  order  partial  differential  equation  of  permanently  parabolic  type 
resembling  Sichel's  transonic  viscous  flow  equation.  Consequently,  our 
expanded  full  potential  equation  does  not  require  the  introduction  of  the 
customarily  used  artificial  time  concept. 


I.  INTRODUCTION.  The  numerical  techniques  for  solving  full 
potential  equation  modelling  transonic  flows  are  presently  based  on  two 
similar  concepts:  an  explicitly  added  artificial  viscosity  [1]  and  an 
implicitly  modified  artificial  density  [2].  Both  techniques  should 
create  additional  terms  in  the  full  potential  equation  in  such  a  way  that 
they  nullify  the  numerical  error  introduced  when  using  upstream  rotated 
finite  differencing  in  the  locally  supersonic  regions. 

If  these  artificial  dissipative  terms  are  introduced  in  a  divergence 
free  form,  it:  was  believed  that  the  numerical  solution  will  be  unique 
[3].  Nevertheless,  in  recent  years  it  was  observed  that  the  question  of 
uniqueness  is  not  resolved  when  isentropic  discontinuities  (shocks)  are 
present  in  the  solution.  It  has  been  shown  that  the  finite  difference 
formulation  [4]  of  the  artificial  viscosity  can  somewhat  affect  the 
location  and  the  strength  of  the  isentropic  discontinuities  without 
causing  numerical  instability  problems.  The  artificial  damping  was 
monotonically  introduced  in  conservative  form  across  the  sonic  line  at 
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all  subsonic  parts  of  the  flow  domain  where  the  local  value  of  the  Mach 
number  exceeded  an  arbitrary  prescribed  cut-off  value  [5]*  A  disturbing 
conclusion  was  that  the  numerical  solution  depends  upon  the  choice  of  the 
initial  guess  for  the  potential  field,  varied  choices  for  which  causing 
the  numerical  solution  to  converge  to  strikingly  different  answers  [5]  if 
shocks  are  present  in  the  field.  At  the  same  time  it  has  been  demon¬ 
strated  that  the  nonunique  solutions  are  not  dependent  on  the  type  of  the 
grid  used  nor  are  they  dependent  on  the  grid  resolution  for  the  grid 
sizes  commonly  used.  The  conclusion  was  that  these  were  the  correct 
nonunique  solutions  of  the  exact  full  potential  equation. 

We  numerically  experimented  with  a  number  of  two-dimensional  full 
potential  cascade  codes  [6,7,8]  and  consistently  observed  non-unique 
symmetric  shocked  solutions.  The  test  case  was  a  nonstaggered  cascade  of 
NACA0012  airfoils  spaced  at  3.6  chord  lengths  apart.  The  free  stream 
Mach  number  was  M  =  0.8  ,  and  the  free  stream  angle  of  attack  was  zero. 
First  order  artificial  viscosity  in  a  fully  conservative  form  [3,9]  was 
used  in  each  of  the  codes.  Numerical  solutions  were  obtained  on  a 
sequence  of  four  successively  refined  0-type  and  C-type  non-orthogonal 
boundary  fitting  geometrically  periodic  grids.  -When  the  grid  points  on 
the  airfoils  surface  were  equidistantly  spaced,  the  iterative  procedure 
converged  to  a  symmetric  solution  with  the  shock  located  at  approximately 
80  percent  of  the  chord. 

Then  the  same  sequence  of  grids  with  the  same  number  of  grid  points 
was  used  with  the  only  exception  that  the  grid  points  on  the  airfoil's 
surface  were  symmetrically  clustered  closer  to  the  leading  edge  and  the 
trailing  edge.  The  result  was  a  symmetric  solution  with  the  shocks 
located  at  approximately  72  percent  of  the  chord.  It  should  be  noted 
that  the  boundary  and  the  periodicity  conditions  were  enforced  explicitly 
and  exactly  and  that  the  airfoil  surface  Mach  number  drop  across  the 
shocks  was  in  both  test  cases  in  close  agreement  with  the  exact  one¬ 
dimensional  isentropic  shock  conditions.  These  results  were  obtained 
with  both  0-type  and  C-type  grids  while  using  the  same  (symmetric) 
initial  guess  for  the  potential  field,  the  same  number  of  iterations  and 
the  same  relaxation  factors  on  each  of  the  grids. 


II.  ANALYSIS  OF  ARTIFICIAL  DENSITY  CONCEPT.  The  following  deriva¬ 
tions  should  suggest  the  probable  cause  for  nonuniqueness  of  the  shocked 
solution  of  the  discretized  form  of  the  full  potential  equation. 

Mass  conservation  equation 

V.(p?*)  =  (£Ss  +^en)-UtS?s  +  D^nen)  =0  (1) 


260 


where  comma  denotes  partial  differentiation,  can  be  written  in  its 
canonical  form  [10] 


V’(pV<*>)  =  p[(l-ir)4>jSS  +  ^nn]  =  0 


(2) 


If  all  the  flow  variables  are  nondimensional ized  with  their  respective 
critical  properties  then 


111  .  -  Ml  M 

P  M*  "*,S 


(3) 


If  an  artificial  density  of  the  general  form 


p  =  P  -  As  up 


(4) 


is  introduced,  it  can  be  shown  that 


1 


c  P  1  +  as  u  M, 


2 

tC 


[4S  u  +  As(u  ^)>s  M*($]  (5) 


Hence,  the  mass  conservation  equation  becomes 


V- (pV<t) )  =  p{[1-M2)<J>  +  *  J  +  As  uM2  <)1  +  E}  =  o 

a 'III  ,SSS 


(6) 


where 


E  = 


AS  M* 


M2i2 


M 


1  +Asv  k:\ss 


2 - C(w  M^.s  *  (u  K]  isM*,ss] 


(7) 


261 


Then  the  undesirable  term  in  the  expanded  full  potential  equation  is 


1  +As^,*,ss 


[-is  u2  g-*  *>sss 


+  wtCl+Y  )  V  1) 


,S  ,SS‘ 


The  switching  function  y  is  customarily  [1*2]  assigned  the  value 


u  “  WJH  =  1  *  ^ 


which  is  obtained  from  the  condition  that  the  term 


AS  u  M  <J> 


should  be  approximately  the  same  magnitude  as  the  terms  introduced  by  the 
upstream  differencing  of  <p  _c  in  the  locally  supersonic  regions  of  the 

flow  field.  If  y^  is  used  in  E  ,  the  result  is  a  cluster  of  truly 

nonlinear  terms 


- As  \  —  [W-lHl-  -V) 


1  M2  cJ2  (<t>  5JZi 

*  71((Ut  ’  <77 '  “nr  *  °*Y  ’ 
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It  is  obvious  that  the  conventional  value  of  y  fails  to  make 
E  =  0  for  any  value  of  local  flow  Mach  number  equal  to  or  greater  than 
one.  Actually,  when  using  yJH  one  ends  up  solving  a  nonlinear  par* 

tial  differential  equation  even  on  the  sonic  line  where  eq.  6  reduces  to 

<P  nn  +  As(l  +  y  )($  )2  =  0  (12) 

9  nn 


The  substantiated  explanation  of  the  influence  of  the  nonlinear 
terns  on  the  final  solution  of  the  expanded  full  potential  equation  is 
not  available  at  the  present  time.  Nevertheless,  it  could  be  speculated 
that  these  nonlinear  terms  have  some  properties  of  solitons  thus  causing 
the  numerically  observed  non-unique  solutions  for  all  realistic  non-zero 
grid  sizes.  The  influence  of  the  nonlinear  terms  can  be  certainly  affec¬ 
ted  by  the  particular  finite  differencing  applied  in  the  evaluation  of 
the  derivatives  constituting  the  artificial  viscosity  [4]  or  artificial 
density  [11].  Different  shocked  solutions  can  also  be  achieved  by  vary¬ 
ing  the  expression  for  the  switching  function  y  [12,13]. 

Full  potential  equation  does  not  involve  any  dissipative  mechanism 
on  the  basis  of  which  expansion  discontinuities  could  be  eliminated.  The 
numerically  created  artificial  viscosity  [1]  and  density  [2]  terms  cou¬ 
pled  with  an  artificial  time  concept  [1]  represented  the  basis  of  almost 
all  transonic  potential  flow  computations  performed  over  the  past  decade. 

By  allowing  for  an  insignificant  vorticity  generation  in  a  limiting 
process  applied  to  a  small  disturbance  transonic  potential  equation, 
Sichel  [14]  has  derived  a  "viscous  transonic  equation." 


+  (K  -  *  )d>  +  A  *  0 

V  %xxx  v  “  ^,x'v,xx  v  ,yy 


(13) 


where 

Kv  =  o  +  ^)/[T(l+Y)Mf]Z/3Re  (14) 

and 

K_  =  0  -  Mf)/[T(l  +  Y)Mf]2/3  05) 

Here,  M  is  the  free  stream  Mach  number,  y  is  the  ratio  of  specific 

heats,  Pr  and  Re  are  Prandtl  and  Reynolds  numbers,  respectively,  and 
x  is  half  the  airfoil  thickness  ratio. 
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Since  Ky  >  0  this  equation  is  parabolic  and  Chin  [15,16,17] 

solved  it  numerically  without  any  need  to  introduce  the  artificial  time. 
Although  it  is  better  suited  for  modelling  the  physical  details  at  the 
shocks,  eq.  13  cannot  be  recast  in  a  divergence  free  form. 


III.  ARTIFICIAL  MASS  FLUX  CONCEPT.  It  is  possible,  neverthe¬ 
less,  to  derive  an  expanded  full  potential  equation  that  will  always  be 
of  a  parabolic  type  and  will  be  readily  expressible  in  a  divergence  free 
form.  The  idea  is  to  expand  the  mass  flux  (rather  than  density  or  the 
speed  of  sound  [2]  alone)  in  a  Taylor  series.  Such  a  modified  mass 
conservation  equation 


V-(pV<j>)  =  0  (16) 


can  be  expressed  in  the  locally  streamline  aligned  orthogonal  coordinate 
system  (s,n)  as 


(  r  ®s  +  f  ®nK(p*,s  "  *s  v(p*,s)is)es  + 
s  n 


(p<J>  n)en^  “  0  (^) 


Hence, 


o[(  1-M2)*)SS  +4,nn]  -  AS  p[U)S(*jSS 


+  u( 


+  <j> 


0 


(18) 


Because 


d  =  M* 

,S  * 


(19) 
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It  follows  that  the  artificial  mass  produces 


p{[(l-M2)4  „  +  ♦  J  +  utM2-!  )♦,«*> 


,ss  ,nn 

-  ds  p{(i-m2)u  *>ss  -  p[(1+Y  )  3  +  S:  '  iC]U,ss)2}  =  0 

(20) 


The  second  brace  contains  undesirable  nonlinear  terms.  The  value  of  the 
switching  function  p  is  determined  in  such  a  way  as  to  eliminate  them 
entirely.  Hence, 


du 

V 


CmS-U  (^y-  -  ^ 


(M*-D 


,  m; 
+  (fir)  — 


d  M, 


(MM)  0^-^-m2) 


(21) 


The  result  is 


V  * 


Zzi 

1  f ,2\y-l 


(m2-D 


(a  ) 


(22) 


Because  of  the  relation 


,  (Ill, (.2)-'  (23) 

fC-1  . L 


it  follows  that  the  modified  mass  conservation  becomes 


p{[(l-M  )$ 


»ss 


d)  ]  +  AS  ("^2 — )  ("fl)  ^  eec 

,nn  ^  p  ,sss 


}  =  0 


(24) 
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This  expanded  full  potential  equation  is  of  parabolic  type  for  any 
value  of  Mach  number  and  its  variable  diffusion  coefficient  does  not 
vanish  for  any  finite  value  of  the  Mach  number.  Note  the  striking 
similarity  between  eq.  24  and  eq.  13  and  the  fact  that  eq.  24  can  be 
integrated  without  a  need  for  artificial  time  variable  [1].  The  diver¬ 
gence  free  form  of  eq.  24  can  be  achieved  as  follows. 

Let 


A  =  p<f>  -  AS  u(p$  _)  e 

,5  ,5 


(25) 


Hence, 


A 


$  [p-AS  u(1 
»  •> 


(26) 


and  the  modified  mass  conservation  (eq.  16)  can  be  expressed  in  its  fully 
conservative  form  as 


v*(5v<fi)  =  (p*  )  +  (p«t  J  u  =  o 

jX  ja  ,y  *y 


(27) 


where  the  exact  form  of  the  artificial  density  is 


(28) 


or 


=  p  -  As(^i)  \  (i)pi 


M 


2  V^S 


(29) 


IV.  SUMMARY.  It  has  been  analytically  proven  that  the  usual 
formulation  of  artificial  density  and  viscosity  terms  leads  to  an  intro¬ 
duction  of  truly  nonlinear  terms  whose  effects  are  suspected  of  causing 
certain  numerical  errors  and  inconsistencies  in  the  numerical  computation 
of  transonic  potential  flows.  A  new  concept  of  artificial  mass  flux  was 
shown  to  produce  only  linear  artificial  dissipation  that  has  the  same 
basic  character  as  a  governing  equation  for  physically  viscous  transonic 
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flow.  The  artificial  mass  can  be  easily  reformulated  in  terms  of  a  new 
artificial  density  in  a  fully  conservative  form. 
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DIAGNOSTIC  ALGORITHMS  FOR  CONTOUR  DYNAMICS 
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Pittsburgh,  PA  15261 


ABSTRACT .  The  goal  of  large  scale  numerical  simulations  (numerical 
experiments)  is  to  obtain  a  quantitative  understanding  of  complicated 
nonlinear  dynamical  processes.  A  proper  picture  or  graph  can  spark  in¬ 
sights  into  new  mathematical  or  physical  processes  and  liberate  us  from 
the  prejudices  of  our  conservative  intuitions.  Diagnostic  algorithms 
and  their  graphs  are  particularly  useful  in  the  contour  dynamics  model 
for  studying  two  dimensional  fluid  dynamics.  This  is  because  the  2D 
densities  are  replaced  by  contours  bounding  piecewise-constant  density 
regions  (i.e.,  ID  curves).  Thus  our  diagnostic  parameters  are  functions 
of  one  variable,  the  arc  length  along  each  curve,  and  their  graphs  are 
2D.  We  discuss  and  illustrate  some  time  dependent  properties  of  planar 
curves,  including  the  spatial  plot,  low  order  moments,  perimeter, 
curvature,  and  Fourier  transforms.  We  will  also  apply  these  techniques 
to  contours  obtained  from  finite-difference  representations  of  con¬ 
tinuum  systems, 

I.  INTRODUCTION.  The  method  of  contour  dynamics  is  ideally 
suited  to  treating  the  dynamics  of  incompressible-inviscid  fluids  in 
two  dimensions.  For  example,  Longuet-Higgins  and  Cokelet  [1,2]  have 
studied  incompressible  shallow  and  deep  water  waves  on  boundaries 
between  regions  where  the  density  is  piecewise-constant.  Zabusky, 
Hughes,  Roberts,  Deem,  Overman,  and  Wu  [3, 4, 5, 6]  have  investigated  the 
Euler  equations  with  piecewise-constant  finite-area-vortex-regions 
(FAVRs)...  Zabusky  and  Overman  [7]  have  studied  how  to  model  surface 
tension, and  dissipation  using  contour  dynamics.  Finally,  Overman, 
Zabusky,  and  Ossakow  [8]  have  been  studying  the  evolution  of  a 
piecewise-constant,  weakly-ionized  and  strongly  magnetized  plasma  in  an 
electric  field,  a  problem  mathematically  analogous  to  the  Buckley- 
Leverett  equations  of  flow  in  a  porous  media  [9]. 

In  this  paper  we  discuss  how  diagnostics  help  to  quantify  and  to 
gain  insight  into  the  time  evolution  of  two  very  different  fluid 
dynamical  problems,  namely  the  Euler  equations  and  ionospheric  plasma 
clouds.  In  Section  II  we  discuss  the  two  mathematical  models  and  in 
Section  III  how  they  are  discretized,  including  how  we  obtain  the 
contour  representations.  In  Section  IV  we  will  use  various  diagnostics 
and  their  graphs  to  study  specific  examples  and  in  Section  V  we  will 
show  how  these  methods  can  be  applied  to  continuum  models. 

II.  MATHEMATICAL  MODELS.  The  Euler  equations  can  be  written  in 
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If  Green's  theorem  is  applied  to  the  result  of  substituting  (2)  into 
(lc)  we  obtain  an  expression  for  the  velocity  as  a  sum  over  the  N 

c 

contour  integrals,  namely 

N 

-1  c 

(u,v)  =  (u(x,y)  ,v(x,y) )  -  (2tt)  £  [u>].  log  l  (d£ ,  dn ) ,  (4) 

j=l  3  BD. 

J 

where  [a)]  .  is  the  jump  in  vorticity  (outside-inside)  at  3d.  and 
3  3 

where  the  dependence  on  time  has  been  suppressed.  Alternately,  if  we 
integrate  by  parts  we  obtain 

1 

(u,v)  =  (,2tt  )  ” 1  l  [co]  [  r1(.x-^,y-n)d£.  (5) 

j=l  J  J3D. 

J 

The  contours  are  advected  by  (d/dt)(.x,y)  =  (u,v). 
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The  equations  of  motion  of  the  ionospheric  plasma  cloud  is  [8] 


V  •  (NVc}>)  =  0, 

(6a) 

3fcN  +  v-VN  =  vV2N, 

(6b) 

v  =  (-e  3  +  e  3  )<J>, 

x  y  y  x 

(6c) 

where  N  is  the  density  of  ions,  E  =  — Vt^  is  the  electric  field  and 

E  E  e*  as  I  (x,y)  I  -»■  and  V  is  the  dissipation  parameter 
ox11 

arising  from  ion-neutral  atom  collisions.  A  two-dimensional  problem 
arises  because  we  assume  that  the  ions  move  orthogonally  to  the 
magnetic  field  B^e^  which  is  assumed  to  be  large  and  unaffected  by 

the  cloud. 


In  the  contour  dynamical  representation  we  assume  that  N  is 
piecewise-constant  and  follow  the  evolution  of  smooth  contours  3D 
at  which  N  is  discontinuous.  In  the  simplest  case  N  takes  on 
only  two  constant  values,  i.e.. 


N(x,y , t) 


N  for  (x,y)  e  D(t) 


|n+  for  (x,y)  4  D(t) 


(7) 


where  D(t)  is  a  bounded,  simply  connected  region  in  IR  .  Using  the 
single-  and  double-layer  Green's  function  of  Laplace's  equation,  we 
find  that  [8] 


3  <Kp) 

n 


C1  EoX(p) 


+  c„ 


3D 


(f)(s)3  In  r(p,s)ds 


C1  Vnx(p)  +  C2 


3  4>(s)3  In  r(p,s)ds 
n._  n  n 
3D  p 


(8) 


where 

c±  =  -2N+/ (N_+N+)  ,  c2  =  -(l/lT)(N_-N+)/(N_+N+) 

e  3D,  r(p,s)  is  the  straight  line  distance  between 

3  and  3  are  the  derivatives  in  the  direction 
n  n 

s  p 

normal  to  the  boundary  at  s  and  p,  respectively, 
obtain  the  electric  field  on  3D.  We  then  advect  3D 


and  where  s,p 
p  and  s ,  and 

of  the  outward 
Using  Eq.  8  we 
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by  [8] 


jjr  (x,y)  =  (~3y<Mx4>)  +  v3gg  (x,y)  (9) 

where  s  is  the  arc  length  on  3D. 

III.  DIAGNOSTICS.  To  discretize  the  contour,  {(x,y)  £  3d},  or 
contours  we  use  a  finite  number  of  nodes,  {(x^,y_.)  |  1  j<  j  <_  N} ,  and 

follow  the  time  evolution  of  these  nodes.  For  the  Euler  equations  we 
use  the  midpoint  method  to  evaluate  the  integral  in  Eq,  5  and 
predictor-corrector  to  advect  the  nodes  [6],  For  the  plasma  cloud 
[8]  we  discretize  Eqs.  (8)  by  the  trapezoidal  method  and  use  Gauss- 
Seidel  to  solve  the  resulting  system  of  equations  for  the  electric 
field  at  each  node.  The  points  are  then  advected,  Eq.  9,  using 
implicit  predictor-corrector. 

To  obtain  a  continuum  representation  from  the  finite  number  of 
nodes  we  use  cubic  splines.  That  is  [7],  we  calculate  the  straight 
line  distance  between  adjacent  nodes. 

As.  =  ((x.+1-x.)2  +  (yj+i-yp2)172, 

and  then  calculate  the  cubic  spline  representations  for 

{ (s .  ,x. )  |  1  _<  j  <  N}  and  {  (s  „,y  )  |  1  <_  j  <_  N}.  This  continuum  re- 
3  J  1  j 

presentation  can  then  be  used  to  [8]  adjust  points  on  the  contour, 
calculate  derivatives  (e.g.,  to  calculate  the  tangent  angle  and 
curvature),  and  calculate  integrals  (e.g.,  the  area  and  center  of 
mass) . 


IV.  DIAGNOSTICS.  In  our  study  of  the  Euler  equations  one  area 
of  particular  interest  is  steady-state  solutions  (V-states)  and  their 
stability.  For  example  in  [5b]  we  investigated  the  stability  of  a 
pair  of  rotating  symmetric  FAVRs  (see  Fig.  2).  In  Fig.  1  we  show 
diagnostics  for  a  V-state  (calculated  numerically)  which  was  run  for 
125  units  of  time  (slightly  more  than  two  revolutions).  We  show  the 
area  change,  AA/A,  the  perimeter  change,  AP/P,  the  change  in  the 
distance  between  the  center  of  rotation  and  the  center  of  area,  Ax/x, 
and  the  change  in  maximum  curvature,  Ak/k,  for  one  of  the  contours. 
These  diagnostics  provide  strong  support  that  we  have  indeed  found  a 
steady-state  solution.  To  investigate  its  stability  we  moved  the 
contours  away  from  the  origin  by  10%.  In  Fig.  2  we  show  the  resulting 
spatial  and  curvature  plots  for  nearly  three  revolutions.  In  Fig.  3 
we  show  the  corresponding  diagnostics.  Note  that  the  perimeter,  x, 
and  k  changes  are  a  factor  of  10  larger  than  in  the  steady-state 
case.  Since  the  perimeter  and  x  changes  show  no  monotonic  growth 
we  conclude,  numerically,  that  this  configuration  is  stable.  We 
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also  moved  the  contours  in  toward  the  origin  by  2.5%  from  the  steady- 
state  solution  and  show  the  merger  of  the  two  FAVRs  in  Figs.  4  and  5. 
In  the  diagnostics  we  have  replaced  Ak/k  by  In  |Ax/x|  to  show  the 
rate  of  approach  of  the  two  contours  more  clearly.  By  use  of  these 
diagnostics  we  have  numerically  verified  a  conjecture  by  Saffman  and 
Szeto  [10]  of  the  transition  point  between  stability  and  instability 
for  symmetric  corotating  V-states. 

In  our  study  of  the  plasma  clouds  there  is  an  additional  diffi¬ 
culty  which  must  be  considered.  Namely,  without  dissipation  (i.e., 

V  in  Eq.  9)  the  contour  dynamics  model  seems  to  be  unstable  or  ill- 
posed  [8].  (We  believe  the  same  is  true  of  the  continuum  model, 

Eqs.  6.)  Thus  it  is  possible  for  round-off  errors  to  cause  large 
numerical  errors  in  a  very  short  time  unless  extreme  care  is  taken. 
(This  difficulty  does  not  seem  to  occur  in  the  Euler  equations  where 
it  seems  to  be  the  low  modes  that  grow  in  merger  or  fission  [5]  and 
there  does  not  seem  to  be  any  high  frequency  instability.) 

One  of  the  basic  questions  we  have  addressed  in  plasma  clouds  is 
the  effect  of  one  cloud  on  another.  We  address  this  question  in  Fig. 

6  where  we  show  the  time  evolution  of  one  cloud  versus  two  clouds. 
Initially  there  is  a  0.01%  perturbation  of  the  boundary  at  the  40th 
mode.  As  our  diagnostics  we  use  the  curvature  and  the  Fourier  modes, 
i.e.,  we  let  the  radius,  r,  be  a  function  of  the  arc  length,  s,  and 
decompose  rCs)  into  its  Fourier  modes.  At  time  0  you  can  see  the 
perturbation  at  the  40th  and  also  a  small  numerical  error  at  the  first 
mode  due  to  the  fact  that  we  have  distributed  more  points  (there  are 
160  on  each  cloud)  on  the  top  of  the  contour,  where  it  will  go  un¬ 
stable,  than  on  the  bottom.  From  the  comparison  of  the  two  cases  we 
can  see  that  the  interaction  of  the  two  clouds  causes  a  large  change 
In  the  behavior  of  the  low  modes  due  to  the  clouds  trying  to  move 
away  from  one  another.  However  the  high  modes  are  hardly  affected 
and  we  see  the  same  number  of  striations  (_"f ingers")  forming  on  the 
top  of  the  cloud.  From  numerical  experiments  such  as  these  we  have 
shown  that  the  intermediate- time  structure  of  a  cloud  is  independent 
of  the  clouds  surrounding  it. 

We  have  also  used  the  curvature  diagnostic  to  validate  the 
accuracy  of  the  numerical  algorithm.  We  compared  the  evolution  of  a 
cloud  with  an  initial  perturbation  of  1,0%  at  the  40th  mode  and  two 
resolutions:  (a)  400  nodes  on  the  contour  and  a  time  step  of  0.005; 

and  (b).  560  nodes  and  a  time  step  of  0.0025  (that  is,  10  or  14  nodes 
per  period,  respectively).  In  Fig.  7  we  show  the  curvatures  at 
t  *  1,2  where  one  can  perceive  only  very  slight  differences.  To  see 
that  the  near-inflection  points  and  new  small-scale  "wiggles"  are  not 
numerical  artifacts  we  show  the  spatial  evolution  of  the  cloud  in 
Fig,  8  and  the  curvature  evolution  in  Fig.  9.  These  inflection  points 
and  "wiggles"  show  how  the  fingers  are  trying  to  align  themselves 
vertically  as  they  lengthen  and  thus  they  predict  the  spatial 
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structure  that  will  later  appear. 


V.  CONTINUUM  MODEL.  It  is  possible  to  use  these  same  kinds  of 
diagnostics  on  two-dimensional  continuum  models  by  focusing  on 
contour  lines.  We  will  restrict  our  attention  to  the  plasma  cloud 
example  where  all  the  runs  were  done  using  a  finite-difference  re¬ 
presentation  of  Eqs.  6  developed  by  S.  T.  Zalesak  at  the  Naval  Research 
Laboratory  [11].  We  warn  the  reader  that  the  length  and  time  scales 
in  the  plots  in  this  section  are  very  different  from  those  used 
previously  and,  in  addition,  the  vertical  scales  in  the  diagnostics 
in  this  section  vary  from  time  to  time. 

In  Fig.  10  at  time  0  we  show  the  initial  density  of  the  cloud  as 
a  contour  plot.  This  initial  condition  was  chosen  to  closely  resemble 
the  piecewise-constant  clouds  of  contour  dynamics  and  so  the  density 
is  nearly  constant  inside  and  outside  the  contour  lines  and  has  a  steep 
slope  to  approximate  the  jump  in  density  in  the  contour  dynamics  clouds. 
There  is  an  initial  perturbation  of  1.0%  at  the  12th  mode.  The  contour 
line  we  will  follow  is  the  one  at  the  average  density  of  the  cloud 
(that  is,  the  average  of  the  density  at  the  center  of  the  cloud  and  at 
infinity  at  time  0).  The  diagnostics  we  will  use  are  the  curvature, 
the  tangent  angle,  and  the  density  gradient  perpendicular  to  the 
contour  (i.e.,  the  maximum  density  gradient).  The  diagnostics  are  all 
smoothed  by  least  squares  interpolation  to  remove  high  frequency  noise 
since,  for  example,  the  small-scale  fluctuations  in  the  curvature  can 
be  up  to  3  times  as  large  as  the  actual  curvature  due  to  the  calculation 
of  the  contour  points  from  the  2D  mesh.  At  time  0  the  curvature  and 
tangent  angle  are  very  close  to  their  analytical  values  but  the 
gradient,  which  should  be  constant,  varies  by  20%.  This  is  due  to  the 
fact  that  a  variable  mesh  was  used  and  the  mesh  size  was  very  large 
at  the  bottom  of  the  cloud,  which  is  stable.  CThis  effect  can  also 
be  seen  in  the  curvature  at  the  bottom  of  the  cloud  at  later  times.) 

The  gradient  diagnostic  is  particularly  valuable  in  plasma  clouds 
because  it  is  the  large  gradient  which  causes  the  fingers  to  form  and, 
at  later  times,  secondary  fingers  to  split  off. 
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Diagnostics  for  the  unperturbed  V-state:  change  in  area,  AA/A  3 
(A(t)-A(O) )/A(0) ;  change  in  perimeter,  AP/P  3  (P(t)-P(O) )/P(0) ;  change 
in  center  of  mass,  Ax/x  ■=  (x(t)-x(0))/x(0) ;  and,  change  in  maximum 
curvature,  Ak/k  3  (max  |K(t)|-max  |k (0) | ) /max  |k(0)|.  (Fig.  2  of 
Ref.  5b.) 
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Figure  2 


Figure  3 


Outwardly  perturbed  V-state:  (a)  physical  Outwardly  perturbed  V-state.  The 

space;  (b)  curvature  vs.  arc  length,  plots  are  the  same  as  in  Fig.  1. 

(Fig.  3  of  Ref.  5b.)  (Fig,  4  of  Ref.  5b.) 
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Figure  4 

Inwardly  perturbed  V-state.  The  plots 
are  the  same  as  Fig,  2,  (,Fig.  5  of 
Ref .  5b . ) 
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Figure  5 

Inwardly  perturbed  V-state.  The 
plots  are  the  same  as  in  Fig.  1 
except  that  the  change  in  curvature 
has  been  replaced  by  the  logarithm 
of  the  change  in  center  of  mass. 
(Fig.  6  of  Ref.  5b.) 
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Figure  6 

Time  =0.  A  comparison  of  the  time  evolution  of  one  and  two  clouds  with  a 
0,01%  perturbation  at  the  40th  mode.  Shown  are:  physical  space,  the 
curvature  vs.  arc  length,  s  (s  =  0  is  the  bottom  of  the  cloud),  and,  the 
amplitude  of  the  Fourier  modes,  M,  of  r(s)  where  r  is  the  radius  from 


the  center  of  mass. 


2 


1(a) 


<b) 


Validation  of  the  accuracy  of  the  contour  dynamics  code.  #The  evolution 
of  a  cloud  with  an  initial  perturbation  of  1%  at  the  40th  mode  and  two 
resolutions:  (a)  400  nodes  and  At  =  0.005;  and  (b)  560  nodes  and 
At  =  0.0025.  The  curvatures  vs.  arc  length  are  shown  at  time  =  1.20. 

(s  =  0  is  the  bottom  of  the  cloud  and  s  =  maximum  is  the  top.)  (Fig. 
10  of  Ref.  8.) 
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Figure  8 

The  spatial  evolution  for  the  same  run  as  in 
Fig.  7.  The  times,  from  left  to  right,  are 
0.00,  0.60,  0.90,  1.10  and  2.30.  (Fig.  11  of 
Ref .  8 . ) 


Figure  9 


The  curvature  vs.  arc  length  corresponding  to 
Fig.  8.  (Fig.  12  of  Ref.  8.) 


the  maximum  gradient  (i.e.,  the  density  gradient  perpendicular  to  the  contour) 


vs.  arc  length. 
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ABSTRACT.  One  of  the  system  of  equations  approximating  the  large 
deflection  of  plates  consists  of  two  coupled  nonlinear  fourth  order 
partial  differential  equations,  known  as  the  von  Karman  equations.  The 
full  symmetry  group  for  the  steady  equations  is  a  finitely  generated 
Lie  group  with  ten  parameters*  For  the  time  dependent  system  the  full 
symmetry  group  is  an  infinite  parameter  Lie  group.  Several  subgroups 
of  the  full  group  are  used  to  generate  exact  solutions  of  the  time- 
independent  and  the  time-dependent  system.  These  include  the  dilatation 
group  (similar  solutions),  rotation  group,  screw  group  and  others. 
Physical  implications  and  applications  are  discussed. 


Q .  INTRODUCTION .  Perhaps  the  most  widely  applicable  method  for 
determining  analytic  solutions  of  partial  differential  equations  utilize 
the  underlying  (Lie)  group  structure.  The  mathematical  foundations  for 
the  determination  of  the  full  group  for  a  system  of  differential  equations 
can  be  found  in  Ames  [1],  Bluman  and  Cole  [2],  and  the  general  theory  is 
found  in  Ovsiannikov  [3].  The  determination  of  the  full  group  requires 
extremely  lengthy  calculations.  Detailed  calculations  can  be  found  in 
Ames  [1],  Ovsiannikov  [3]  and  for  the  Navier  Stokes  equations  in  Boisvert 
[4]  (see  also  Boisvert,  et  al.  [5]).  Here  we  give  the  results  of  those 
calculations  for  the  von  Karman  equations  (see  (0))  of  nonlinear  elasticity 
in  the  form  of  the  infinitesimal  generators  of  the  full  group.  These 
have  also  been  obtained  in  an  independent  study  by  Schwarz  [6]  using  an 
algebraic  program  package  which  uses  REDUCE  [7] ,  In  Russia  such  an 
algebraic  programming  system,  for  this  purpose,  is  available  under  the 
name  CINO  (see  Ovsiannikov  [3],  p.  57).  MACZYMA  (Roseneau  and  Schwarzmeier 
[8])  has  also  been  used  for  this  purpose  on  other  problems. 


Our  goal  is  to  obtain  explicit  invariant  solutions  to  the  system 
of  partial  differential  equations,  due  to  von  Karman, 

A^F  =  E[w^  -  w  w  ] 
xy  xx  yy 

9  a  Vi* 

A  w  =  ^-  +  i~[Fw  +  F  w  -  2F  w  ] 

D  D  yy  xx  xx  yy  xy  xyJ 


(0) 
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by  employing  various  subgroups  of  the  full  group  admitted  by  these 
equations.  To  apply  this  procedure  we  choose  a  one  (or  more)  parameter 
subgroup  and  calculate  the  general  form  of  the  subgroup  invariants. 

We  then  require  the  equations  to  be  invariant  under  this  group*  As  a 
result  a  set  of  simultaneous  algebraic  equations  arise  and  their  solu¬ 
tion,  possibly  involving  arbitrary  parameters,  leads  to  a  more  specific 
form  of  the  invariants.  Substitution  of  these  invariants  into  (0) 
results  in  a  representation  of  the  system  from  which  solutions  to  the 
original  system  can  be  constructed. 

A  preliminary  version  of  this  paper  appeared  in  [9]. 


1.  THE  EQUATIONS.  The  investigation  of  large  deflections  of  plates 
rests  on  the  solution  of  two  coupled  nonlinear  partial  differential  equa¬ 
tions  known  as  the  von  Karman  equations  [10],  Let  us  consider  a  rectangular 
elastic  plate  under  the  combined  action  of  a  uniform  lateral  load  and  a 
tensile  force  in  the  middle  plane  of  the  plate.  Denote  by  w  the  deflec¬ 
tion  of  the  plate  away  from  its  equilibrium  position  in  a  region  D  of  the 
xy  plane  and  by  F  the  Airy  stress  function.  Assuming  that  there  are  no 
body  forces  in  the  plane  of  the  plate  and  that  the  lateral  load  is  perpen¬ 
dicular  to  the  plate,  then  w  and  F  satisfy  the  equations 


~2 

d  w 


~2 
d  w 


9x  9y 


■]; 


(l.i) 


.2  q  h*  r92F  92w  .  92F  92w 

Aw  =  d  +  t[7ttt  +  -27t 

9y  9x  9x  9y 


2  2 
^F_  9  w 

9x9y  9x9y 


(1.2) 


Here  E  is  the  modulus  of  elasticity,  D  is  the  flexural  rigidity,  h*  is  the 
thickness  of  the  plate,  q  is  thelateral  load  intensity  and  A2  is  the 
biharmonic  operator,  i.e.. 


lx  lx 

a  9_ 

2  2  +  4 

9x^9yZ  9y 


Determination  of  the  stress  function  F  allows  us  to  calculate  the  stresses 
in  the  middle  surface  of  the  plate  by  means  of  the  relations 


•  _  ifl  _  32F  .  _  32F 

x  "  3y2  5  y  "  3x2  ’  "  "  3x3? 


(1.3) 


From  the  function  w,  which  defines  the  deflection  surface  of  the  plate, 
we  can  obtain  the  bending  and  shearing  stresses. 


In  our  group  analysis  of  equations  (1.1)  and  (1.2),  we  shall  be 
interested  in  the  two  cases  q  =  0  and  q  =  -p92w/9t2.  The  first  of  these 
cases  represents  the  situation  in  which  there  is  no  lateral  loading  while 
the  second  describes  the  vibration  of  a  plate  whose  deflections  are  large 
in  comparison  with  its  thickness.  *  Rather  than  treat  (1*1)  and  (1.2) 
directly,  we  introduce  the  new  variables,  x,  y,  t,  F  and  w  which  are  defined 
by 
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x  =*  mx. 


y  =  my, 


t  =  pt,  F  -  bF,  w  =  cw. 


The  choices 


,  D  ,  2  D 

b  =  h*  and  c  =  o^eT 


as  well  as 


a/2  ,  2  ,h*D.  1/2 

m  -  D  and  p  =  p(-^-)  , 

in  the  case  q  =  -p3  w/3t  ,  lead  to  the  dimensionless  sytem 


a2f  -  ($-2-  \  2 *  ^ 

W 

.2  =  £X  iOL  _l  ifl  3ji  ,  32F  32w 
"  ~  3y2  3x2  3x2  3y2  ’  3x3y  3x3y 


(1.4) 

(1.5a) 


or 


„2  32w  .  32F  32w  .  32F  32w  „  32F  32w 

3t2  3y2  3x2  3x2  3y2  "  3x3y  3x3y 


(1.5b) 


where  the  bars  have  been  dropped*  We  shall  investigate  (l,4)-(l,5a)  or 
(1.4)-(l,5b)  using  group  analytic  techniques  with  the  aim  of  obtaining 
exact  solutions  of  these  systems. 

2 .  FULL  GROUP .  The  full  (Lie)  group  of  the  time-dependent  von  Karman 
equations  (1.4,  1.5b)  is  given  by  14  infinitesimal  operators 


x  =  JL  .  x  =— ■  x  -  -2-  •  x  =  v—  -x— • 
1  3t  ,  2  3x  ’  3  3y  ’  y  x  a™  » 


3x 


3y 


v  _  2t  —  +  x  —  4-  v  —  *  X  =  —  •  x  =  t  —  ■ 

5  3t  3x  y  3y  •  o  3w  *  /  3w  ’ 


Xg  =  x 


3w  *  X9  "  y  3^  ;  X10  ^  tx  3^  5  X11  =  ty  3^  5 


(2.1) 


X12  ”  fi^x  3F  y  xx3  “  f 2 ^y  9f  ;  X14  f3^  3F  * 


Each  of  the  first  11  generators  is  associated  with  a  parameter  independent 
of  all  the  others.  These  generate  a  finite  dimensional.  Lie  algebra  L^. 
The  last  three  operators  contain  arbitrary  functions  of  time,  f^(t), 
i  =  1,2,3. 

3,  DILATATION  GROUP.  In  this  section  we  investigate  the  action  of 

the  dilatation  group  (from  X^) 
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a  -  a  ,  v 

X  =  a  X,  y  =  a  y,  (a  >  0) 


(3.1) 


on  the  dimensionless  time  independent  von  Karman  equations  (1.4)-(1.5a) . 
The  invariants  of  this  transformation  group  are  ([1]) 


n  -  x/y. 


(3.2) 


The  von  Karman  equations  are  invariant  under  the  dilatation  group  and 


f  =  f(n);  w  =  h(n). 


(3.3) 


Substitution  of  these  relations  into  equations  (1.4)-(1.5a)  leads 
to  the  following  two  coupled  ordinary  differential  equations  for  f  and  h: 


1  rn  .  2  ,  4.  df,  =  ,dh.2  . 
3  [d  +  n  +  n  )  dnl  (d11)  » 


(3.4) 


dJ  r  _l_  2  .  4.  dh,  r  d  ,  df  dh.  df  dh, 

TT  [(l  +  n  dnl  =  ~  Wn  W  +d^  W- 

dri 


(3.5) 


Several  exact  solutions  of  the  system  (3 .4) -(3 .5)  can  be  obtained. 
If  we  choose  the  deflection  to  be  constant  (i.e.,  h(f|)  =  constant),  then 
equation  (3.5)  is  satisfied  identically  while  (3.4)  becomes 


— "  [  (1  +  +  h^)  -  0. 

dn  1 


(3.6) 


Integration  of  this  equation  gives  the  invariant  solution 


f  (n)  =  yr  2  [(c^  sin  4>  Hn(z  -  2z  cos  6  +  1) 

/3  v=0  1  V  V 


+  (c^  cos  <f>^)arctan[  (z  -  cos  <J^)/sin 


(3.7) 


+  — arctan[  (4n  4-  2)/3]  +  c^, 

where  z  =  /r\ ,  4)  =  -2tt/3  +  vtt  and  the  c.  (i  =  1,...4)  are  arbitrary  con¬ 

stants. 

Another  exact  solution  can  be  determined  by  letting  h(ri)  =  cf(n) 
for  a  constant  c  4-  0*  Combining  equations  (3.4)  and  (3.5)  we  find  that 
f (n)  must  satisfy 

d  r  ,df.2,  n  ,  2,  /df v  2 

dn[n^dn)  ]  *  "(1  +  c  ) W  • 


f(n)=±^n-(c/2)+D 

c 
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4*  cD 


h(n)  -  ±  f  t,-<c2/« 

where  B  and  D  are  constants. 

We  note  here  that  under  the  dilatation  group  the  stresses  acting  in 
the  middle  plane  of  the  plate  are  related  by  the  equation  y2ay  +  x2ax  = 
2xyrx  .  In  practice,  one  is  usually  more  interested  in  these  quantities 
than  the  deflection  surface. 


Remark  2,1.  Since  the  equations  for  the  Airy  stress  function  F  and 
the  deflection  w  are  invariant  under  coordinate  translations  x  =  x  +  b 
and  y  *  y  +  d,  it  follows  that  the  von  Karman  equations  are  also  invariant 
under  the  group  given  by  T)  -  x/y,  f(n)  and  h(n). 

4,  ROTATION  GROUP.  We  now  examine  the  properties  of  the  von  Karman 
equations  under  the  rotation  group  which,  for  any  real  a  >  0,  is  defined 
by  the  transformation  (from  X^) 

x  =  x  cos  a  -  y  sin  a;  y  =  x  sin  a  -f  y  cos  a;  w  =  w;  F  =  F. 

It  is  known  [1]  that  the  invariants  of  this  group  are 

n  =  x2  +  y2;  F  =  f(n);  w  =*  h(n).  (4.1) 


Upon  substituting  (4,1)  into  equations  (1.4)-(1.5a) ,  we  find  that  f  and 
h  must  satisfy  the  two  differential  equations 


dn 


,2  d2f,  _  d  r  -dhv 2, 

fr>  7T]  "  WnW  Is 

dri 


(4.2) 


,  d2  .2  d2h,  d  ,  dh  df. 
dn2  dn2  "  dri  dri  dri  ‘ 

Both  of  these  equations  can  be  integrated  once  to  obtain 


(4.3) 


,  d  r  2  d2f,  ,dh.2 

4  dn  [T1  ^2 ]  =  ”n  W 


+  A; 


(4.4) 


d2h1  _  n  dh  df 
dn2’ 


(4.5) 


for  constants  A  and  B.  Two  particular  choices  of  h  lead  to  exact  solu¬ 
tions  of  Ghis  system.  With  h  =  constant,  equation  (4.5)  is  satisfied 
identically  if  we  take  B  =  0  while  (4.4)  reduces  to  a  linear  equation 
which  has  solutions  of  the  form  (singular  at  the  origin) , 


f(Tl)  =  cx  +  c2  In  ti  +  c^ri  +  c4n  In  r»,  (n  ^  0) 

where  the  are  constants.  A  second  approach  is  to  set  h(n)  =  cf  (ri) 
for  a  constant  c.  Equations  (4,4)-(4.5)  then  lead  to 
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(4.6) 


n<£>!  -  »! 


where  the  constant  D  depends  upon  A,  B,  and  c.  For  n  f  0,  (4.6)  integrates 
to  give 

1/2 

f  =  ±2Dr|  '  +  K, 

with  ic  a  constant  of  integration. 

It  is  interesting  to  observe  that  if  we  set  dh/dr)  =  df/dri  in  (4.4) 
or  df/dn  =  -dh/dn  in  (4.5),  we  obtain  in  both  cases  an  equation  of  the 
form 


^  [n2  f* 1  +  *ng2  =  c,  (4.7) 

where  g  -  df/dn  and  X  =  1/4  or  g  =  dh/dn  and  X  =  1/2.  For  c  =  0,  (4.7) 
is  an  equation  of  the  Emden-Fowler  type  about  which  there  is  considerable 
literature  (e.g.,  see  [11]).  Because  analytic  solutions  of  this  class  of 
equations  are  known  only  in  a  few  cases,  the  appearance  of  such  an  equation 
in  the  analysis  is  indicative  of  the  difficulties  one  finds  in  attempting 
to  find  exact  solutions  of  the  von  Karman  equations.  We  note  here  that 
for  c  =  0,  equation  (4.7)  can  be  transformed  into 

^f+rfV-O  (4.8) 

dn 

by  setting  y(n)  =  nXg(n)*  One  established  fact  about  equation  (4.8)  is 
that  all  of  its  solutions  are  oscillatory  on  (0,OT),  by  which  we  mean  that 
for  any  >  0  there  exists  r)2  >  such  that  y(n2)  =  0*  This  type  of 
result  gives  us  information  about  a  subset  of  those  solutions  of  the  von 
Karman  equations  which  are  invariant  under  the  rotation  transformation 
group. 


5,  SPIRAL  GROUP.  As  we  shall  see,  the  von  Karman  equations  also 
remain  invariant  under  a  third  group  of  transformations,  the  spiral  group, 
which  has  the  form  (a  linear  combination  of  X2,  X^,  X^  and  X^  (with 
f^(t)  constant)) 

x  =  x  +  Y-^a,  y  =  y  +  Y2a,  F  =  eaaF,  w  =  e^aw,  (5.1) 

for  real  a  >  0  and  y^j  Y2»  3  constants.  The  invariants  of  this  group 

are 


n  =  y2x  -  Y^y,  f  (n)  -  Y2ln  F  “  ay>  Mn)  =  Y2ln  W  -  6y.  (5.2) 

Substitution  of  (5.1)  into  equations  (1.4)-(l*5a)  leads  to  the  two 
invariance  conditions  a  =  23  and  3  =,ot+3  which  together  imply  that 
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a  -  3  =  0*  Thus,  (5.2)  becomes 


n  =  y2x  -  y^y,  f(n)  =  Y2ln  F>  h(n)  =  Y2ln  w 


(5.3) 


and  the  von  Karman  equations  are  reduced  to  a  fourth  order  orindary  dif¬ 
ferential  equation  which  both  f  and  h  must  satisfy,  namely 


+  4bg 1  g,M  +  6c(gf)2g"  +  3b(g,f)2  +  d(g')4  =  0 


(5.4) 


where 


a  =  y;  +  y;y,  +  2  ^  ;  b  =  Y;  +  2Y:  +  -T  ;  C  -  Y9  +  -T  +  2  rr 


Y1  T1 

d  =  1  +  ^  +  2  ; 


We  observe  that  under  the  action  of  the  spiral  group,  the  original  set  of 
partial  differential  equations  is  not  only  uncoupled,  but  also  transformed 
in  such  a  way  that  the  invariants  f(f|)  and  h(ri)  satisfy  the  same  differen¬ 
tial  equation. 

Consider  equation  (5.4)  with  Y-j_  "  Y2  =  1  and  u(n)  =  g?(n).  We  then 


ufu  +  4uu”  4-  6u2ut  4  3(u!)^  4  u^  =  0. 


(5,5) 


This  equation  is  linearized  by  raising  the  order.  The  transformation 


yT  =  uy 


gives  rise  to  the  sequence 


yn  -  y[u*  4  uZ] 


y1"  -  y[u"  +  3uuf  4  u  ] 
y<iv)  =  y[u'"  +  W  +  6u2u'  +  3(u')2  +  u4]. 
Consequently,  the  truth  of  (5.5)  implies  that 


whereupon  the  solution  for  u  is 


u  =  yVy  = 


3A3n  +  2A2n  +  ax 


A3n3  +  A2f|2  +  Ain  +  A0 


3n  +  2An  +  b 
3  2 

n  +  ati  +  Bn  +  c 


(5.6) 
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Whereupon, 


g(n)  =  in[n3  +  An2  +  Bn  +  c]  +  d.  (5.7) 

Since  both  f  and  h  must  satisfy  (5.4),  it  follows  that  f(ri)  =  h(n)  = 
gCh)  is  an  exact  solution  with  four  arbitrary  constants  A,  B,  C  and  D, 

Hence 


3  2  D 

F  =  w  =  G[rf  +  Ari  +Bri  +  C],  G  -  e  ,  r|  =  x-y 

constitute  a  set  of  exact  solutions  to  the  von  Karman  equations. 

Remark  2.2.  For  the  time  independent  problem  other  linear  combina¬ 
tions  of  X2,  X3,  X4,  X5  (without  3/9 t),  X^,  Xg,  X9,  X]_2>  x13>  x14  (with 
fi(t)  =  constants)  can  be  used  to  generate  solutions.  Some  of  these  may 
be  the  same  as  can  be  ascertained  by  examination  of  the  commutator  table 
and  the  splitting-nonsplitting  analysis.  That  is  not  shown  here, 

6,  THE  TIME  DEPENDENT  EQUATIONS.  Equations  (1.4)  and  (1,5b)  will 
be  studied  here  using  the  dilatation  generator  X5.  The  group  invariants 
are 


c,  =  x//t;  n  =  y//t;  f  =  f(c,n);  w  =  h(C,n). 


(6.2) 


This  transformation  results  in  the  following  partial  differential  equations 
for  the  functions  f  and  h: 


A? -  _  ,32h  .2  A  , 

'  "  3C2  3n2  * 

a2.  _  _  1  ,r2  afh  ,  2  4. 

A  h  '  4  3c2  n  3ti2  2?n 


(6.3) 


,  or  3h  „  9h  . 

+  35  K  +  3n  5 


,32f  a2h  ,  3^f  afh  2  a2f  a2h  . 

an2  a?2  ac2  3n2  "  353n 


(6.4) 


If,  at  this  point,  we  attempt  to  reduce  equations  (6.3)  and  (6.4)  by 
re-applying  the  dilatation  group,  we  discover  that  such  a  reduction  is 
not  possible.  However,  we  can  find  solutions  of  this  system  by  considering 
the  action  of  the  rotation  group 

C  -  Ceos  a  -  nsin  a; 

n  =  Csin  a  +  hcos  a. 

As  indicated  previously,  the  invariants  of  this  transformation  group  are 

(6.5) 


C2  +  n2;  f  =  v(z);  h  =  u(z). 


Substitution  of  these  relations  into  (6,3)  and  (6.4)  leads  to  two  coupled 
ordinary  differential  equations  for  u  and  v,  namely 


296 


(6.6) 


2  2 

d  r  2  d  v1 

— 2  C2  — i 

dz  dzz 


1  d  r  ,du.2, 

-  4  dT  1; 


[z 


2  d; 

dz 


uLl  _  1_  _d_  r  du  dv-, 
2  2  dz  Lz  dz  dzJ 


.  i  [2z  + 

16  1  dz 


2  d/ 
dz 


-1 

2  * 


(6.7) 


One  set  of  solutions  to  this  system  is  u(z)  =  c  and  v(z)  =  a^zln  z  + 
a2ln  z  +  a3z  +  a 4  where  c  and  the  a ^  (i  =  1,..., 4)  are  constants.  It 
thus  follows  that  the  time  dependent  von  Karman  equations  possess  the 
exact  solution 


w  =  c 


If  we  choose  u(z)  =  v(z)  in 
obtain  the  following  .equation  for 


equations  (6.6)  and  (6.7),  then  we 
v(z) 


3  d  ,  ,dv,2,  1  r,  dv  2  d2v, 

Tt  tz(d^  1  “I6  [2z  d^+  2  7T1’ 

dz 


from  which  a  first  integral  can  be  obtained.  It  follows  that 


3  ,dv.  2  1  2  dv 

4  2  (dl>  =  16  2  S  +  Y* 


(6.8) 


where  y  is  a  constant  of  integration.  For  z  ^  0  we  can  solve  for  dv/dz 
in  (6.8)  and  obtain  v(z)  (and  thus  u(z))  after  a  quadrature. 


An  alternative  way  of  dealing  with  equations  (6.3)  and  (6.4)  is  to 
look  for  solutions  of  the  form  f  =  g(z)  and  h  =  p(z)  where  z  =  £-ri. 

Under  such  a  transformation,  equations  (6.3)  and  (6.4)  become  linear 
equations 


d4p  .  1  2  O  3  dp  „ 

-4  +  16z72+T6ZI?“°-  (6 

dz  dz 

The  first  of  these  equations  can  be  readily  integrated  to  give  g(z)  = 
c-|_z  +  c2z  +  c^z  +  c^  for  constants  ci  (i  =  1,...,4).  Information  about 

the  stress  distribution  in  the  middle  surface  of  the  plate  can  then  be 
obtained  since 
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implies  that 


F 


1  6c  (x-y) 

-  t  =  a  =  cr  *  “  [— 4: - +  2cJ, 

xy  x  y  t  Yt  2 

Equation  (6.9)  has  two  classical  solutions  p^  =  sin(z^/8)  and 

P2  =  cos(z2/8)  and  a  third  solution  of  the  form  p3  =  /sin(t-T>t”^2ciT, 
t  -  2^/8.  (These  solutions  are  due  to  our  colleagues  Frank  Stallard 
and  Thomas  Morley.) 

7.0  OTHER  POSSIBILITIES.  The  four  groups  considered  (dilatation, 
rotation,  translation  and  spiral)  are  not  the  only  ones  under  which  the 
von  Karman  equations  remain  invariant.  To  illustrate  another  (among 
the  many)  possibility  consider  X4  +  eX^,  i.e.,  the  invariants  are 
obtained  by  integrating 


[yir~ x  37 +  = 0  (7-1} 

where  £  is  a  constant.  The  invariants  are 

,  2  2,1/2  ,  _  .  -1  x 

r  =  (x  +y  )  and  I  =  sin  —  -  £t. 

Consequently,  solutions  of  the  form 

F  =  g[(x  +  y  )  ,  sm  -  -  Et] 

with  a  corresponding  form  for  w  can  be  sought. 

It  should  also  be  noted  that  these  analyses  yield  invariant  solu¬ 
tions  of  the  differential  equations  alone.  It  may  not  be  possible  to 
match  these  exact  invariant  solutions  to  general  auxiliary  conditions 
such  as  prescribed  boundary  values  or  energy  conservation  integrals. 

However,  these  solutions  are  interesting  since  they  are  often 
asymptotic  limits  of  other  solutions. 


298 


REFERENCES 


1.  W*  F.  Ames,  Nonlinear  Partial  Differential  Equations  in  Engineering, 
Vol.  II,  Chapter  2,  Academic  Press,  New  York,  1972 

2.  G.  W.  Bluman  and  J,  D.  Cole,  Similarity  Methods  for  Differential 
Equations,  Springer,  New  York,  1974. 

3.  L.  V.  Ovsiannikov,  Group  Analysis  of  Differential  Equations  (Russian 
Edition,  NAUKA  1978);  English  translation  edited  by  W.  F.  Ames, 
Academic  Press,  1982. 

4.  R.  E,  Boisvert,  Group  Analysis  of  the  Navier-Stokes  Equations, 

Ph.D.  Dissertation,  Georgia  Institute  of  Technology,  Atlanta,  GA 
30332,  1982. 

5.  R.  E.  Boisvert,  W.  F,  Ames  and  U.  N*  Srivastava,  Group  Properties  and 
New  Solutions  of  Navier-Stokes  Equations,  in  press  Jour.  Engineering 
Mathematics. 

6.  F.  Schwarz,  Personal  communication. 

7*  F.  Schwarz,  A  Reduce  Package  for  Determining  Lie  Symmetries  of 

Ordinary  and  Partial  Differential  Equations,  Comp.  Physics  Commun. 

27,  p.  179-186,  1982. 

8.  P,  Roseneau  and  J.  L.  Schwarzmeier ,  Similarity  Solutions  of  Systems 
of  Partial  Differential  Equations  Using  MACSYMA,  Courant  Inst,  of 
Math.  Sci.  Report  No.  C00-3077-160/MF-94,  1979. 

9.  K.  A.  Ames  and  W.  F.  Ames,  On  Group  Analysis  of  the  von  Karman 
Equations,  Int.  Jl.  Nonlinear  Analysis:  Theory,  Methods  and 
Applications,  6^  845,  1982. 

10.  S,  P.  Timoshenko  and  S.  Woinowsky-Krieger,  Theory  of  Plates  and 
Shells,  McGraw-Hill,  New  York,  1959. 

11.  J.  S.  W,  Wong,  On  the  Generalized  Emden-Fowler  Equations,  SIAM 
Rev.  17,  339,  1975. 


299 


ON  THE  CONTROL  OF  A  LINEAR  STOCHASTIC  SYSTEM  WITH 


FINITE  HORIZON 


<*) 


P,  L.  Chow  and  J.  L.  Menaldi 
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ABSTRACT .  We  consider  a  dynamic  system  whose  state  is  governed  by  a  linear 
stochastic  differential  equation  with  time-dependent  coefficients.  The 
control  acts  additiveiy  on  the  state  of  the  system.  Our  objective  is  to 
minimize  an  integral  cost  which  depends  upon  the  evolution  of  the  state 
and  the  total  variation  of  the  control  process.  It  is  proved  that  the 
optimal  cost  Is  the  unique  solution  of  an  appropriate  free  boundary  problem 
in  a  space-time  domain.  By  using  some  decomposition  arguments,  the  problems 
of  a  two-sided  control,  i.e,  optimal  corrections,  and  the  case  with  con¬ 
straints  on  the  resources,  i.e.  finite  fuel,  can  be  reduced  to  a  simpler 
case  of  only  one-sided  control,  i.e.  a  monotone  follower.  These  results 
are  applied  to  solving  some  examples  by  the  so-called  method  of  similarity 
solutions . 


I.  INTRODUCTION.  The  optimal  control  of  a  stochastic  system  has  a  wide  range 
of  applications,  such  as  optimal  production  scheduling,  inventory  control, 
investment  policy,  as  well  as  the  traditional  problems  in  guidance  and  the 
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trajectory  tracking,  (see,  e.g.  [1]  -  [9]).  The  stochastic  models  are 
commonly  used  in  favor  of  the  determinisitc  ones  because  of  the  ever 
presence  of  noises  and  uncertainty. 

In  this  article,  we  consider  the  evolution  of  a  randomly  perturbed 
system  whose  state,  under  an  additive  control  action,  is  governed  by  an 
Ito's  equation.  The  objective  is  to  minimize  the  expected  total  cost  over 
a  finite  horizon,  with  or  without  the  constraint  on  the  total  resources. 

The  methods  of  analysis  are  based  on  the  dynamic  programming  technique  [10] 
and  the  variational  inequalities  [11].  The  main  results  to  be  presented 
here  are  taken  from  our  recent  joint  paper  [12].  We  show  that  the  optimal 
cost  function  may  be  characterized  as  a  unique  solution  of  a  free-boundary 
problem  for  a  certain  parabolic  equation.  The  optimal  control  is  given  by 
a  reflected  diffusion  from  the  free  boundary.  Our  decomposition  and  re¬ 
duction  theorems  allow  us  to  deal  only  with  the  one-sided  control  without 
resource  constraint.  This  simplifies  the  general  problem  considerably. 

As  a  concrete  example,  a  special  case  is  solved  explicitly  with  the  aid  of 
the  so-called  similarity  solution.  Even  though  the  analysis  is  done  in  one 
dimension,  most  results  may  be  extended  to  higher  dimensions. 

IX.  STATEMENT  QF  THE  PROBLEM.  In  this  paper,  we  wish  to  control  a  linear 
stochastic  differential  equation  in  the  sense  of  Ito  by  using  additive 
strategies,  i.e.  the  evolution  of  the  state  is  governed  by 

dy(s)  =  dv(s-t)  +  (a(s)y(s)  +  b(s))ds  +  a(s)ds (s-t) ,  s  ^  t  , 

a 

y(t)  «  x  +  v(0)  ,  ■ 

2 

where  a(s),  b(s)  and  a  (s)  stand  for  the  drift  and  the  covariance  terms, 
and  x  is  the  initial  state  at  the  time  t  .  (v(s),  s  a  0)  stands  for  the 
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control  which  is  a  progressively  measurable  process  with  locally  bounded 
variation. 

To  each  control  v  in  ,  the  class  of  admissible  controls,  it  is 
associated  with  a  cost  given  by  the  payoff  functional 

T  s 

J  t(v)  -  E£f  f (y (s) ,s)exp(-J  a(X)d\)ds  +  c(t)v(0) 

X  t  t 

T  s  ^ 

+  J1  c(s)exp(-J  or(X)dl)d\j(s-t)  ]  , 

I  t  t 

where  f,ar,c  and  T  are  respectively,  the  running  cost,  the  discount  factor, 
the  instantaneous  cost  per  unit  of  fuel  and  the  finite  horizon. 

Our  purpose  is  to  characterize  the  optimal  cost 

u(x,t)  =  inf[Jxt(v)  :  v  in  V}  (3) 

and  to  construct  an  optimal  control  v  ,  i.e. 

0  in  *)f  such  that  u(x,t)  ■»  Jxt(v)  (4) 

for  each  initial  state  (x,t)  . 

A  similar  study  will  be  made  for  the  optimal  cost 

v(x,z,t)  -  inf[J^t(v)  :  V  in  K,  v(T)  £  z]  ,  (5) 

where  the  positive  constant  z  stands  for  the  total  amount  of  fuel  available. 

This  is  associated  with  the  previous  cases  under  constraint  of  resources. 
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III.  MAIN  RES ULTS .  Consider  the  differential  operators 


Au  =*  a2(t)  -  (a(t)x  +  b (t) )  +  <*(t)u 


and 


BU  *  -  t  ~  ^ 


(6) 

(7) 


A  heuristic  application  of  the  dynamic  programming  to  the  problem  (1),  (2) 
yields  the  following  Hamilton-Jacobi -Be liman  equation 


(Au  -  f)  V  Bu  =  0  in  B  X  [0,T)  , 
u(  ■  ,T)  =  0  in  B  , 


(8) 


where  x  V  y  denotes  the  maximum  of  the  two  real  numbers  x  and  y  ,  Equation 
(8)  may  be  used  to  characterize  the  optimal  cost  u  given  by  (4),  for  which 
the  control  is  one-sided. 

For  the  corresponding  problem  with  finite  resources,  we  have  the  following 
lIamilton-Jacohi -Be liman  equation 


(Av  -  f)  V  B’v  -  0  in  lx  (0,«)  X  [0,T)  , 

v(- ,* ,T)  -  0  in  JR  X  [0,*)  ,  (9) 

Av  -  f  in  lx  [0]  X  [ 0 ,T) 


to  be  satisfied  by  the  optimal  cost  v  given  by  (5)  ,  where 


(10) 


We  assume  that  a(t),  b(t),  a(t) ,  ar(t),  c(t)  are  Lipschitz  functions 
from  [0,T]  into  B  and  either  c(t)  a  cQ  >  0  for  every  t  or  c(t)  =  0 
for  every  t  ,  and  f(x,t)  is  a  smooth,  positive  function  satisfying  certain 
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growth  properties.  Then  the  optimal  cost  u  can  be  shown  to  be  the  unique 
solution  of  the  following  free  boundary  problem 


4-1 

II 

5 

and 

Eu  £ 

0 

if 

X  a  x*(t)  , 

Au  *  f 

and 

Bu  = 

0 

if 

x  s  x*(t)  . 

where  the  free  boundary  x  *  x*(t),  t  €  [0,T]  ,  is  defined  by 

x*(t)  =  inf [x  ~  (x,t)  +  c(t)  >  0}  .  (12) 


In  general  we  have  obtained  the  following  results: 


(Rl)  Decomposition  Theorem.  Suppose  that  u(x,t)  and  v(x,z,t)  are  the 
optimal  costs  for  the  unlimited  and  limited  resources,  respectively,  and 
u°(x,t)  be  the  cost  (2)  without  control  (v  =  0)  .  Then  the  optimal  cost 
v  can  be  decomposed  as  follows: 

v(x,z,t)  =  u(x,t)  +  h(x+z,t)  ,  (13) 

where 

h(x,t)  =  u°(x,t)  -  u(x,t)  .  (14) 


(R2)  Reduction  Theorem.  If  the  running  cost  function  f  is  symmetric  about 
the  trajectory  x  =  xQ  so  that 

f(x,t)  *  f [ 2Xq( t)  -x,t]  +  0(t) 

for  some  function  0  ,  then  the  optimal  cost  u  for  a  two-sided  control  has 
the  property 

u(x,t)  =  u(2xQ(t)  -  x,t)  +|(t),(x,t)  in  Ex  [0,T]  ,  (15) 

with 
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T  -  s 

0(t)  “J  [0(s) +b(s)c(s)  +  xQ(s)c(s)]exp(-J  a(\)d\)ds  .  (16) 

t  t 

Because  of  this  symmetry,  one  may  reduce  the  problem  to  the  case  of  one-sided 
control. 


(R3)  Optimal  Policy.  Let  x  =  x  (t)  be  the  free  boundary  for  a  one-sided 
control.  Then  the  optimal  policy  is  to  keep  the  evolution  y(t)  £  x  (t) 
with  the  minimum  use  of  resources,  that  is 


y(s)  £  x  (s)  for  s  £  t  , 
dv(s-t)  —  0  if  y(s)  >  x  (s) 


V(0)  = 


0  , 


if  x  >  x  (t) 


x  (t)  -  x,  otherwise  . 


(17) 


The  optimal  control  is  determined  by  the  equation  (1). 

In  view  of  the  above  results  (Rl)  -  (R3),  to  construct  an  optimal  policy 
for  the  two-sided  control  with  or  without  the  resource  limitation,  we  only 
need  to  solve  the  problem  of  the  one-sided  control  with  unlimited  resources. 
Therefore  our  results  yield  a  drastic  simplification  in  treating  the  general 
problem. 


XV.  EXAMPLES .  To  illustrate  our  results,  we  shall  consider  some  examples. 

We  assume  that  the  coefficients  ajbjQfjO'  in  (1)  and  (2)  are  constant,  and  the 
running  cost  f(x)  is  time -independent.  In  addition,  let  c(t)  s  0  ,  i.e., 

the  cost  for  control  is  negligible.  For  a  >  0  and  b  <  0  ,  the  equation  (1) 
may  be  interpreted  as  an  inventory  model  for  which  the  rate  of  demand  decreases 
as  the  stock  increases.  On  the  other  hand,  if  a  <  0  and  b  >  0  ,  it  becomes 
the  Lagenvin  equation  describing  the  motion  of  a  Brownian  particle  in  the 


306 


gravi tati.on.3-L  field*  Then  the  problem  pertains  to  the  control  of  an  Ornstein 
Uhlenbeck  process. 


For  unlimited  resources,  the  average  cost  (2)  yields 

T 

J  t(v)  ={Ej  f(y(s))e  asds]  . 
t 

By  our  result  (Rl) ,  the  optimal  cost  u  must  satisfy 


A0a  =  f 

and 

dx 

0 

if 

* 

x  a  x  (t)  , 

A0u  S  f 

and 

c)u 

dx  ' 

0 

if 

x  £  x*(t)  ,  0  *  t  s  T  . 

where 


V 


du 

dt 


1  2 

"  2  ° 


-  (ax  +  b)  |^  +  au 


x  (t)  =  inf[x  :  |“(x,t)  >  0] 


(18) 


(19) 


(20) 


(21) 


To  construct  the  solution  u  for  x  a  x  (t)  ,  we  let  s  =  (T-t)  so  that 
(11)  gives  the  following  free-boundary  problem 


v(x,s)  =  u(x,T-s)  , 


_  dv  12  32v 

Os  ~  2  °  ,2 

?)x 


(ax+b)  |^+arv  =  f(x) 


and 


by 

dx 


S: 


0 


for 


* 

X  >  X 


(T-s), 


0  £  s  *  T  , 


v(x,0)  =  0 


dv 

dx 


x 


x*(T-s) 


(22) 


where  v(x,s)  -  u(x,T-s)  . 
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By  a  proper  change  of  variables,  the  system  (22)  may  be  reduced  to  a 
free-boundary  problem  for  a  heat  equation.  Then,  by  applying  the  method  of 
similarity  solution,  the  free-boundary  is  found  to  be 


* 

x  (t) 


6 

2 


-2a(T-t) 
[  a 


1/2 


b 

a  ■ 


0  £  t  ^  T  , 


(23) 


where  the  parameter  6  is  determined  by  the  equation 

..  „>i«,  ..  ,,-2  -X/2(X2)  -  62) ,, 

46cp  (5)  [  [cp,  (X)  ]  e  d\ 

62  +  1  =  1  Jti  1 - j - 

«P1(6)£p[(6)j'“[cpia)]'V1/2X  a  - 1 


(24) 


This  may  be  solved  numerically  to  yield  6  =  -0.6388..* 

Now,  suppose  the  resource  v  for  control  is  finite  so  that  0  £  v(T)  s  z  • 
The  optimal  cost  v(x,z,t)  can  be  decomposed,  according  to  (Rl)  into  two  simple 
problems.  That  is, 

v(x,z,t)  -  u^(x+z,t)  -  [u(x+z,t)  -  u(x, t)  ]  (25) 

where  u(x,t)  is  the  optimal  cost  without  resource  constraint,  while  u°(x,t) 
is  the  cost  of  free  evolution.  Therefore  it  must  satisfy 

AQu°  “f,  0  s  t  <  T  ,  x  €  1R  , 

u°(x,T)  =  0  ,  (26) 

u°(x,t)  =  0(|  f  (x)  | )  as  |x|  -*  =  , 

where  AQ  is  defined  by  (20).  The  equation  (26)  may  be  solved  to  give 
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Note  that  the  free  boundary,  given  by  (23),  remains  unchanged.  In 
particular,  for  m  “  2  ,  this  problem  may  be  solved  explicitly. 

We  wish  to  point  out  that,  for  the  optimal  correction  problems,  the 
case  of  vanishing  cost,  c  =  0  ,  is  less  interesting.  In  this  case  the  optimal 
policy  would  be  to  counteract  the  noise  as  long  as  the  resources  remain 
available  so  that  f(y(t),t)  is  kept  to  the  minimum. 
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ABSTRACT 

We  study  the  Cauchy  problem 

f  u  =  <|>(u  )  ,  <x,t)  e  R  x  r+  , 

i  u  XX 

1  »<•,<>>  -  f 

with  the  piecewise  linear  constitutive  function  $(£)  "  £+  =  max (0,5)  and 
with  smooth  initial  data  f  which  satisfy  xf 1 (x)  >0,  x  6  R,  and 
f " (0 )  >  0.  We  prove  that  the  free  boundary  s,  given  by  ux(s(t)+,t)  =  0, 
is  of  the  form 

s(t)  =  -ic/t  +  0(/t>,  t  *  0+  , 

where  the  constant  <  =  0.9037...  is  the  (numerical)  solution  of  a  particular 
nonlinear  equation.  Moreover,  we  show  that  for  any  a  e  (0,1/2), 

2 

|^“T  f(s(t))|  =  0(ta  S,  t  +  0+  . 
dt 

The  proof  involves  the  analysis  of  a  nonlinear  singular  integral  equation. 
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A  NONLINEAR  INTEGRAL  EQUATION  OCCURRING  IN  A  SINGULAR  FREE  BOUNDARY  PROBLEM 

12  1 
Klaus  Hollig  *  and  John  A-  Nohel 


1.  Introduction  and  Result. 

(1) 


We  study  the  Cauchy  problem 
u.  =  <J>(u  )  ,  <x,t)  e  R  x  R 

t  XX 


[  u<*,0)  =  f 

with  the  piecewise  linear  constitutive  function  :  R  ♦  R+  given  by 
<MS)  =  £+  =  max(£,0);  the  initial  data  f  :  R  *►  R  are  assumed  smooth, 
f  e  C^(R)  with  bounded  derivatives,  and  satisfy  the  conditions 


specifically 


(2) 


xf * (x)  >  o,  x  e  r  , 
f*<0)  >  0  . 


One  motivation  for  the  study  of  the  Cauchy  problem  (1),  (2)  is  its  similarity  with  the 
well-known  one  phase  Stefan  problem  (in  one  space  dimension)  [3, 4, 7,8]  in  which  one  would 
assume  f'(x)  =  -1  for  x  <  0,  as  well  as  f*(x)  >  0  for  x  >  0,  so  that  f*  has  a 
jump  discontinuity  at  x  *  0.  The  assumption  (2)  yields  a  different  behavior  of  the 
solution  u  and  of  the  resulting  free  boundary.  Indeed,  here  (c.f.  the  Theorem  below), 
the  free  boundary  s,  given  by  \ix(s(t)  +  ,t)  =0,  is  of  the  form 
(3)  s(t)  -  -«/t  +  0(tV2+°),  t  0+  , 

where  tc  is  a  positive  constant  and  0  <  a  <  1/2.  Thus,  the  function  s  is  not 
(infinitely)  differentiable  at  t  =  0,  contrary  to  the  situation  for  the  Stefan  problem 


[7]. 

The  result  (3)  is  established  by  solving  a  nonlinear  integral  equation  ((15)  below) 
with  kernels  which  depend  on  the  unknown  function  s  and  which  are  also  singular  in  the 
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One 


sense  that  the  integral  on  (0,t)  of  the  kernel  does  not  approach  zero  as  t  +  0+. 

T  p 

consequence  of  this  is  that  the  integral  operator  defined  by  (15)  is  not  compact  in  a 
suitable  Holder  class* 

The  principal  motivation  for  the  study  of  the  Cauchy  problem  (1),  (2)  is  that  it 
serves  as  a  prototype  of  nonlinear  parabolic  problems  which  arise  as  monotone  "convex idea¬ 
tions"  of  nonlinear  diffusion  equations  with  nonmonotone  constitutive  functions  $  (see 

[5]  and  [6]);  in  [6,  section  4]  the  reader  will  also  find  the  formulation  and  preliminary 
analysis  of  such  a  convexified  problem,  corresponding  to  a  piecewise  linear  nonmonotone 
( specif  icallly,  <t>'((-®,a)  U  (b,®))  >  0,  <f>*(a,b)  <0,  0  <  a  <  b  <  ®) .  The  analysis  in  [5] 
shows  the  existence  of  infinitely  many  solutions  u  of  the  nonmonotone  problem,  each 
having  ux  bounded,  and  ux  omitting  the  values  in  [a,b];  thus  each  solution  u 
exhibits  phase  changes.  Numerical  experiments  further  suggest  the  conjecture  that  the 
"physically  correct"  solution  of  the  nonmonotone  problem  is  the  one  which,  as  t  +  m, 
approaches  the  unique  solution  of  the  appropriately  related  convexified  monotone  problem. 
However,  for  small  t  >  0  the  behavior  of  the  solution  of  (1),  (2)  is  qualitatively 
different  (see  (3)).  The  present  study  of  (1),  (2)  is  intended  as  a  step  towards  the 
understanding  of  this  intriguing  phenomenon.  The  relation  of  the  convexified  problem  in 

[6]  to  the  Cauchy  problem  (1),  (2)  is  clear  (the  particular  boundary  conditions  in  [6]  do 
not  play  a  role  in  the  analysis  of  the  free  boundary  curve). 

It  is  simple  to  give  a  formal  explanation  for  (3).  We  rewrite  (1),  (2)  as  the  free 
boundary  problem 


- 


uxx' 


s(t)  <  X  < 


t  e  r+ 


(4a)  *{  ux(s(t),t)  =  0 

U(*,0)  =  f  . 

From  the  constitutive  function  $  one  also  has  the  equation 


(4b) 


ut  -  °/ 
u( •  ,0)  -  f  . 


<  x  <  s(t),  t  €  R+ 


Therefore,  assuming  the  continuity  of  u  across  the  free  boundary  s(t)  and  assuming 
that  s  is  monotone  decreasing  (c.f.  paragraph  preceding  the  Theorem),  we  have 
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■<> 


u(s(t),t)  »  f (a(t) ) ,  t  €  R+  , 

(5) 

s(0 )  =  0  . 

Differentiating  (5)  with  respect  to  t  and  using  ux(s(t)+#t)  =0#  where  denotes 

the  limit  from  the  right,  we  obtain 

(6)  £'(s(t))s'(t)  =  uxx(s(t)+,t)  . 

Since  by  the  assumption  (2) 

f'(x)  =  f" (0 )x  +  0(|x|2),  |xl  ♦.  0+  , 

a  simple  calculation  formally  yields  (3)  with  <  =  /2  (provided  one  assumes  continuity 
from  the  right  of  ut  and  u^  up  to  the  free  boundary  s) . 

The  rigorous  treatment  of  the  problem  consists  of  analyzing  in  Section  3  the  nonlinear 
integral  equation  (15)  for  the  free  boundary  x  =  s(t).  Our  analysis  shows  that  (3)  holds , 
but  that  the  constant  <  is  the  solution  of  the  nonlinear  equation  (16);  its  numerical 
value  is  <  -  0*9037.,,,  and  not  k  =  /2  which  was  predicted  by  the  above  formal 
calculation.  It  also  follows  that  s(t)  is  smooth  for  t  >  0  thus  justifying  (5)  and  (6) 
for  positive  t;  in  particular  one  sees  from  (6)  that  s  is  as  smooth  as  the  initial 
function  f  is.  We  remark  that  for  t  >  e  >  0  the  problem  (1),  (2)  can  also  be  viewed  as 
a  one  phase  Stefan  problem;  consequently  the  results  in  Kinderlehrer  and  Nirenberg  [7] 
yield  the  regularity  of  the  free  boundary  for  t  >  0. 

The  existence  of  a  unique  generalized  continuous  solution  for  problem  ( 1 ) ,  and  hence 
of  a  unique  free  boundary,  follows  from  nonlinear  semigroup  theory  for  mraccretive 
operators  [1,2],  Approximating  (1)  by  the  implicit  Euler  scheme  one  can  also  show  the 
existence  of  the  free  boundary  s  which  is  Holder  continuous  on  -  [0,®>  with  exponent 
1/2  and  monotone  decreasing.  However,  using  such  general  methods,  it  is  not  possible  to 
analyze  the  precise  behavior  of  s  at  t  =  0. 

Our  main  result  is: 

THEOREM .  Define 

(7)  ,  r(t)  =  f(s(t)>  . 

Then  for  any  a  e  (0,1/2)  there  exists  T  >  0  such  that  r  is  continuous  on  [0,T]  and 
satisfies 
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(8)  t1_°|rMt)|  <  c.<£),  0<t<T, 

where  c(f)  >  0  is  a  constant  which  depends  on  the  data  f.  Moreover/  (3)  holds  with 

1/2 

*  !=  (2  fkojl  *  0.9037...  . 

The  constant  k  is  the  (numerical)  solution  of  equation  (16)  in  Section  3. 

By  the  definition  of  K,  the  result  (3)  follows  from  (7)  and  the  assertion  (8),  To 

t 

see  this,  we  solve  (7)  for  s*  Let  R(t)  /  r(x)dx  and  integrate  (7)  obtaining 

0 

R(t)  *  f ( s ( t ) )  -  f (0 )  . 

Define  the  function  g  implicitly  by 

g(-sign(x)  /f(x)  -  f(0)4)  =  x  . 

Since  we  assume  that 

(9)  f(x)  -  f(0)  -  32x2  +  0( |x| 3 ) ,  1x1  +  0+  , 

(B2  =  f"(0)/2),  g  is  well  defined  for  small  lx|  and 

(10)  g(x)  =  -B^x  +  0(  1  xj 2 )  /  | x  1  +  0+  .  ’ 

For  a  small  interval  [0,T],  the  monotone  decreasing  solution  of  (7)  is  given  by 

(11)  s(t)  =  g(/R(t7),  0  <  t  <  T  , 

and  (3)  follows  from  (8)  and  (10). 

The  Theorem  describes  the  regularity  of  the  free  boundary  at  t  =  0.  It  is  sharp  in 
the  sense  that,  unless  ff,,(0)  =  0,  the  estimate  (8)  does  not  hold  for  cx  >  1/2  (c.f.  the 
Remark  at  the  end  of  the  paper  in  Section  3). 

It  should  also  be  observed  that  the  second  derivatives  of  the  solution  u  are  not 
continuous  at  the  point  (x,t)  ®  (0,0),  because  using  (6),  (7)  and  the  definition  of  < 
one  has 

2 

lim  uxx(s(t)+Jt)  =  r(0)  =  f"(0>  *  lim  =  f"(0)  . 

+  + 
t+0  x+0 

However,  on  the  set  {t  :  f'(s(t))  <  0}  the  free  boundary  s  is  as  smooth  as  the 
function  f.  This  can  be  shown  by  a  bootstrap  argument,  using  standard  regularity  results 
for  the  heat  equation  on  a  domain  with  curved  boundaries.  We  believe  that  the  Theorem  can 
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be  extended  to  a  general  monotone  constitutive  function  ^  with  4>'(*)  discontinuous  at 
0  and  with  ♦' (C)  >  c  >  0,  5  6  R+?  the  corresponding  value  of  <  will  depend  on 
$’<0+). 

The  Theorem  is  proved  in  Section  3  by  solving  an  integral  equation  for  the  function 
r  derived  in  Section  2. 

We  are  grateful  for  helpful  discussions  with  our  colleagues  Tom  Beale,  Carl  de  Boor, 
Michael  Crandall  and  Emmanuel  DiBenedetto;  we  also  thank  Fred  Sauer  for  the  numerical 
computations  * 


2 •  The  Integral  Equation  for  the  Free  Boundary *  Let 


■(*,t>  :=  -1=  t“1/2exP(-  £) 

2  A 


denote  the  fundamental  solution  of  the  heat  equation.  Let  v  :=»  ux  be  the  solution  of  the 
problem 


(4a* ) 

and  assume  that 
identity 


vt  -  vxx,  (x,t)  e  {(x,t)  :  x  >  s(t),  t  e  (0,T)}  , 

*  v(  s{  t) ,  t)  =  0 

v{ • ,0 )  =  f 

the  free  boundary  s  satisfies  s  e  C[0,T]  n  C1(0,T].  Integrating  Green’s 


3__ 

35 


(T(x  -  5,t  -  t)v^(5,t) 


T)v(C,T) ) 


over  the  domain  8 

t 

(12)  v(x,t) 


(13)  Vx't> 


-  (r(x  -  £,t  -  T )v(C,T) )  -  0 
we  obtain,  for  x  >.s(t),  the  representations 

«  t 

/  r(x  -  5,t)fMC)d5  -  /  r(x  -  s(T),t  -  T)V  (S(T),T)dT  , 

0  0  4 

OB  t 

/  T(x  -  5,t)f"(5)d£  -  /  r  (x  -  s(T),t  -  T )v_ (s(T ) ,T)dT  . 
0  o  x  * 
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Passing  to  the  limit  x  +  s(t)+  in  (13)  yields 


*  t 

(14)  r(t)  =2  /  T(s(t)  -  C,t>frt(S)d5  -  2  /  T  (s(t)  -  s  ( x ) ,  t  -  x)r(x)dx  , 

0  0  X 

where  (see  (6)  and  (7))  r(t)  =  ~  f(s(t))  =  v„ (s(t),t).  The  justification  for  this 

at  x 

passage  to  the  limit  is  contained  in  the  following  result. 

LEMMA  1*  If  s  e  C( [0,T] )  n  c1 ( (0,T] )  and  r  e  C( [0,T] ),  we  have  for  t  <  T 


lim  /  [T  (s(t)  -  s(T),t  -  x)  -  I*  (x  -  s(T)#t  -  x )  ]  r  ( x )  dx  ■  —  r(t)  . 
x  \  s(t)  0  x 


Proof.  We  write 


/  ( •  *  •  ]  r  dx 

0 


r*  s(t) 

-  s(x)  r  i 

r  (x  -  s(x))2x 

J  3  /2  l  exP< 

0  (t  -  x)J/-: 

L"  4(t  -  x)  J 

X  -  s(t)  I 

r  f  (x  -  s(xl) 

4/ir  0 

(t-T)3'2  ' 

Lexpl  4(t  -  x ) 

x  -  s(  t) 

/  (x  -  s( t) ) 2 

4/7  0 

(«  -  X)3/2 

4(t  -  x) 


3  t 


v=1  0 


In  view  of  the  assumptions  on  s  and  r  it  is  easy  to  see  that,  for  v  -  1,2, 


t  t  t-6  _ 

1/  I  |  <  |/  I  I  +  |/  I  |  <  0(/6)  +  c.o(|x  -  s(t)|) 

0  t-«  0 


which  implies  that 


li»  /  I  -  0. 

X  \  S(t)  0 


v  =  1,2 
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t  2  ^ 

Finally,  —  /  - 3/2  4(tY-~rT'dT  =  2  iBpUes  that 


4  A  -<*  (t  “  T) 


r  1 

lim  J  I3  =  j  r(t)  * 


x  \  s{  t)  0 


3.  Proof  of  the  Theorem.  We  write  the  integral  equation  (14)  in  the  form 


(IS)  r(t)  -  -r  /  exp(-  \  -  e)2)f(«/t>« 

/tT  o  A 


+  1_  f  exp(_A(S(t/T)2)r(tT)dT 

/it  1  ‘  T 


=:  (Fr) (t)  +  (Kr) (t)  , 


where 


,  v  s ( t)  -  s(tt) 

A(s,t,T)  :*  ryz  * 

2(t  -  tT)  /2 


It  will  be  convenient  to  introduce  the  class  of  functions  H  [0,T],  0  <a<  1, 
defined  by 

/fa[0,T]  -  {p  :  [0,  T]  >  R  :  I P I  I-  sup  f1  a|p'(t)|  <  *}  * 

°  0<t<T 


The  class  ffa  is  obviously  contained  in  the  Holder— class  with  exponent  «• 

The  Theorem  is  a  consequence  of  s 

PROPOSITION*  For  any  a  6  (0,1/2),  the  integral  equation  (15),  with  s  related  to  r 
(11),  has  a  solution  r  €  tfa[0,T]  for  some  T  >  0.  The  constant  <  :=  /r(O)/0 
(g2  =  j  f"(0))  does  not  depend  on  f  and  is  implicitly  determined  by  the  equation 


(16) 


~  !  exp(- (T  +  - 

/ir  0 


A  0 


K  _ 1 _ 

2  /I  “  T(1  +  A) 


exp(- 


1  -  A>  _  ^ 

- z)dTJ 

1  +  A 


t 


the  numerical  value  of  <  ^is_  0*9037...  . 

REMARK.  The  Proposition  does  not  assert  uniqueness  of  the  function  r  (hence  of  the  free 
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boundary  s)  which  could  be  established  by  showing  that  the  operator  F  +  K  in  (15)  is  a 
strict  contraction;  this  is  technically  even  more  complicated  than  our  proof.  However,  the 
uniqueness  of  r  is  a  consequence  of  the  uniqueness  of  solutions  of  the  original  problem 
(1)  discussed  in  the  Introduction. 

We  prove  the  Proposition  by  iterating  the  integral  equation  (15)  in  the  form 

(17)  rn+1  =  Frn  +  Krn,  n  e  H  , 

2  2  2  1 
with  r(0)  =  r0  =  ic3,  where  <  is  the  solution  of  (16)  and  3  =  —  f"(0). 

We  shall  show  as  a  consequence  of  Lemmas  2  and  3  below  that,  for  r  e  H**  with 

r (0  )  =  kV, 


(18) 


(19) 


w 

lim  (Fr)  ( t)  =  ~  I  «xp<-  7  <K  +  S)2)2B2d?  , 


t+0 


0 


lim  (Kr)(t)  =  -  “  /  “  1 - 3-  exp(-  j—  - - ^ic^^dT  . 


t+0 


/ir  0  /1-t(1+/t) 


1  +  A 


2  2 

Since  <  is  the  solution  of  (16),  this  implies  that  rn(0)  -  K  5  for  n  e  H. 

a 

Moreover,  we  shall  establish  the  a  priori  estimates:  for  r  e  H  [0,T],  0  <  a  <  1/2, 
(20)  |Fr|a  <  c(T)  +  (c^a)  +  c(T))|r|a  , 


where  c„(a)  =  _  exp(-  7  <2),  and 


/ir(1  +  a) 


(21) 


jKrl  <  (c  (a)  +  c(T) ) | r |  , 

a  2  a 


where  c^fa)  -  c21<a)  +  c22^a^  with 


1 


-  ~  /  — - - -  exp(“  1  ~  l?l)dT  * 

z  2/w  0  /I  -  T(1  +  /t)  1  +  /t 


k(1  + 


1 


:)  1 


2  +  2a7  r  1  -  T 

C22  a  - -  f 


1/2 +a 


t .  /I  * 

(1  -  K  - ~J 

1  +  /t 


/it  (2  +  4a)  0  (1  -  l) 


3/2 


r  < /I  “  T>  . 

eXPl"  A - lJdT  r 

4  1  +  A 


and  where  c(T)  is  a  constant  such  that  c(T)  ■*  0  as  T  ♦  0  ,  uniformly  for 
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r  e  {p  i  I P ( 0  > I  +  |p|  <  const.} • 

ct 

We  first  use  the  estimates  (20),  (21)  to  complete  the  proof  of  the  Proposition. 
Combining  the  estimates  (20)  and  (21)  one  has 


(22)  lr  |  <  c(T)  +  (c.,(cl)  +  c  (a)  +  c(T))|r  |  . 

iu  i  *  n  a 

Crucial  for  the  following  argument  is  the  fact  that 

C1(2)  *  C2(2)  =  °*339-**  +  0.453...  to  <  1  . 

—  1  +  to 

Set  to  ;=  — - —  <  1  and  choose  a  e  (0,1/2)  close  to  1/2  and  T  >  0  such  that  for 
all  r  e  Ha  with  r(0)  =  »c202  and  |rl  <  — - — 

°  1  -  s 


c^(a)  +  c2(a)  +  c(T)  <  to  . 

It  should  be  observed  that  if  one  chooses  a  >  1/2  then  we  cannot  prove  the  crucial 
estimate  (20),  cf.  e.g.  (24).  By  (22),  we  have 

I r  I  <  — n  e  N  . 

n  a  i-5 

Hence  we  can  select  a  subsequence  of  rR  which  converges  in  C[0,T]  to  a  function 

r  e  H  [0,T]  with  r  (0)  «  <  $  .  Set  s-  g(/R  ).  To  pass  to  the  limit  in  (17)  note 

*  “  1  sn(t)  2 
that  by  Lemmas  2  and  3  below  the  expressions  exp(-  j  3 - -  C)  )  and 

A(s  ,t,T)  /t 

— ^ —  exp(-A(sn,t,T)42)  converge  pointwise  (for  n  +  *)  and  are  majorized  by 
integrable  functions,  uniformly  in  n  e  H.  This  completes  the  proof  of  the  Proposition  and 
of  the  Theorem. 

It  remains  to  establish  the  assertions  (18)-(21).  We  require  two  auxiliary  results. 

We  denote  by  c  a  generic  constant  which  may  depend  on  a,  JrJ^  and  T,  and  we  assume 

throughout  that  T  =  T(|rla,a)  is  sufficiently  small. 

a  2  2 

lemma  2.  For  r  e  H  ,  a  e  (0,1/2),  with  r(0)  -  <  0  we  have 

Js(t)  +  xi^t|  <  ct1/f2+a  . 

a  i+a 

Proof.  Note  that  |r(t)  -  r(0)|  <  ct  and  therefore  |R(t)  -  r(0)t|  <  ct  .  Using  (10;, 
(11)  and  this  inequality  one  has 

|s(t)  +  <^tl  -  |g(^R(t))  +  c/t)  <  |-0  Vft(t)  +  k/ t 1  +  ct  <  ct1/2+a  +  ct  . 
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ot  2  2 

LEMMA  3.  For  r  e  H  with  r(0)  -  k  $  we  have 


|A(s,t,T>  |  <  c  - - -  -  c 

/l  -  T 


Proof.  Using  f'(s)sl  -  r,  (9)  and  Lemma  2,  we  obtain 


|s(t)  -  s<tx)|  =  if  - ^  da|  <  c  /  (202k/<j  -  ca1/2+a)*1dc  <  c(/t  -  /tx)  ? 


/I  -  t 
1  +  /r  ’ 


this  establishes  the  claim  by  the  definition  of  A(s,t, x). 

Lemma  3  shows  that  the  kernel  corresponding  to  the  operator  K  in  (15)  is 
integrable.  Moreover,  we  see  from  Lemma  2  that 

(23)  A„(T)  :=  lim  A(s,t,T>  =  -  |  -  ~  /x  . 

Using  this  and  Lemma  2,  we  can  pass  to  the  limit  in  (15),  thus  establishing  (18)  and  (19), 
Proof  of  (20).  To  estimate  the  norm  of  Fr,  use  the  definition  in  (15)  to  form 


*(F^(t)  =  ~  I  exp(-  ~  -  5)  )  \  t"1/25f"(5/t)d5 


/*  0  4  /t  2 


-  [±  !  J  -  e)«p(-  J  -  e)  )f(5/t)dc]  _ 

✓it  0  /t  /t  /t 


1  ( s(t) 


d  s(  t) 


As  t  \  0,  the  term  in  square  brackets  tends  (use  (9))  to 


I  i  <■*  ■  5>8xp<-  7  <-e  -  !>2)2B2dS 


✓it  0 


Therefore, 

(24) 


*  -  62exp(-  j  k2)  «  2(1  +  eOie^c  (a) 

✓iT 


|d(FT)(t) |  <  ct-1/2  +  (2(1  +  a)K@2c  (a)  +  -(t))||_ 

dt  1  dt  j- 
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It  remains  to  estimate  Using  (7),  Lemma  2,  (9)  and  (10)  we  have 

dt  a 

-  2  ^/I1  =  t'3/2|F(I(T)Tl|tr<t)  "  2  s(t)f'(s(t))l 

<  t'3/2t'1/2(|  e-2*:'1  +  o(t) )  [ |  tr(t)  -  S2s(t)2|  +  Ct3/21 


<  t-1(^  S-2tt_1  +  5(t>>[|tr(t)  -  R(t)|  +  ct3/2l  . 


A  simple  calculation  shows  that 

(25) 

and  this  yields 

(26) 


|tr(t)  -  R(t)J  <  -  -  t1+aJr|a  , 


ifj  I  <  ta  $  2<  1  +  c(t))(1  +  c(t))|rla  . 


Combining  (24)  and  (26)  proves  (20). 

We  next  turn  to  the  proof  of  (21 ) .  We  write  (cf*  (15)) 


~T  (Kr)  (t)  =  —  /  ~  1  "  A  exp(-A2)Tr'  (tx)dx 

dt  0  1  "  T 


+  —  /  ,  1  ■  (1  -  2A2)exp(-A2)  77  r(tT)dT  =:  (K.r) (t)  +  (K.r) (t) 

/?  0  1  "  T  dt  1 


and  estimate  each  term  separately. 

(i)  Since  |r' (tt)|  <  (tx)a  1 1 r | a  ,  it  follows  from  (23)  that 

(27)  l(X1r>(t)|  <  (c21  +  c(t))t“-1|r|a  . 

d 

(ii)  To  estimate  K2r  we  first  consider  the  term  —  A(s,t,x). 
A  and  ( 7 ) ,  we  obtain 

2(t  -  tT)1/Zt  ~  (•;  -s-(t>  "  =  ts’(t)  -  (tT)s'(tT)  - 

dt  2  (t  -  tt)1/z 


Using  the  definition  of 

*  s(t)  +  ■“  s(tx)  = 


/ 


tx 


d_ 

da 


(os' (a) 


1_ 

2 


s  ( a ) )  da 


+  a 


SL 

da 


(  £121 _ 

'■f,(s(a>) 


■))da 


i*e. 
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(20) 


dA 

dt 


1  -1 


t  '  ( t  -  tx)  1/2  /  (Q1  (o)  +  q  (a))do 


ti 


with 


Q,(o)  i=  a  — (g| 

1  f ' (s(  a) ) 


Q2(a)  :=  s'  (a) -  a  ^(-g>f"(3(a)  >)  . 

f '  (s(a) )2 


We  estimate  each  term  separately.  By  Lemma  2  and  (9),  we  have 


(29) 


x  t  aa~* |r | 

/  12,(0)130  <  /  o— — - f~-  d0  < 

tT  tx  2B2r/o  -  co1/2+“ 


^2  1/2  h  a  9  2|e  1  +  c(t))t1/2+“(1  -  T1/2+a)|r|  . 


We  write  Q2  in  the  form 


°2(a)  "  (f*  (g(/R(a)  >  ))2  -  ar (a)f "(s (a) ) )  , 


Since  by  (9),  (10)  and  Lemma  2, 

(f'(g(/R(o))>)2  -  202r(c)|  ] 


|ar(a)f”(s(a))  -  202cr(a) | 


<  ca 


3/2 


we  obtain,  using  also  (25), 


(30) 


t  t 

/  lQ2(o)|da  <  /  - ■  (i  + 

tT  tT  (20  k/o)3 


0(0) )  (2B2  -j-J-j  a1+a| r  |  ) da  < 


(4  TT  +  a)(1/2  +  a)  8  2k-1  +  o<t))tV2+“(1  -  x1/2+a)|r|a  . 


Combining  (29) 
(31) 


and  (30)  with  (28),  it  follows  that 

i  (Kjr)  (t) 1  <  ^  rV\i  +  ~  —L)(i  +  c(t))t0_1 


1  /  1  -  x1/2+“ 

*  /f  I  77T  t)3/2  Hi  +  c( t) )r (0 )dT  . 
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Adding  the  estimates  (27)  and  (31)  proves  (21).  . 

Remark .  We  conjecture  that,  for  smooth  initial  data  f,  the  function  r(t2)  is  smooth, 

i.e . 

(32)  r(t)  =  k2S2  +  r1/2^t  +  rit  +  •**  • 

Assuming  an  expansion  of  the  form  (32),  we  can  calculate  the  coefficients  ri/2,r1'’** 

from  the  integral  equation  (15).  In  particular  f*"(0)  f  0  implies  that  r-j ^  ?  °*  This 

shows  that  (8)  is,  in  general,  not  valid  for  a  >  1/2. 
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NONLINEAR  INVERSE  HEAT  TRANSFER  CALCULATIONS  IN  GUN  BARRELS* 


Alfred  S.  Carasso 
Center  for  Applied  Mathematics 
National  Bureau  of  Standards 
Washington,  DC  20234 


ABSTRACT 

We  consider  the  problem  of  determining  the  temperature  history  inside  a  gun 
barrel  from  embedded  thermocouple  measurements  at  some  distance  away  from  the 
inside  wall.  This  inverse  problem  leads  to  an  improperly  posed  initial  value 
problem  for  a  nonlinear  system  of  partial  differential  equations,  whenever  the 
thermal  properties  are  temperature  dependent.  We  discuss  a  step-by-step 
marching  algorithm  for  the  numerical  computation  of  such  problems.  The  scheme 
is  stabilized  by  appropriately  filtering  in  the  frequency  domain  at  each  step. 
We  illustrate  this  technique  with  a  numerical  experiment  on  a  nonlinear 
problem  whose  exact  solution  is  known.  The  basic  ideas  are  applicable  to 
other  unstable  evolution  equations. 


I.  Introduction 

This  report  summarizes  the  results  of  an  important  computational 
experiment  on  a  nonlinear  inverse  heat  conduction  problem  whose 
exact  solution  is  known.  We  consider  the  prolem  of  determining 
the  temperature  history  at  the  inside  wall  ot  a  gun  barrel ,  from 
embedded  thermocouple  measurements  at  various  points  in  the  annular 
metallic  region  between  the  inner  and  outer  radii  of  the  cannon. 

As  the  shell  is  fired,  a  continuous  trace  is  recorded  at  each 
thermocouple,  providing  temperature  as  a  function  of  time  at  the 
correspondi ng  fixed  spatial  location. 


*  Research  sponsored  by  the  U.S.  Army  Research  Office 
under  MIPR  No.  ARO  63-82 
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The  present  study  centers  around  a  novel  computational  technique 
designed  especially  for  coping  with  the  nonlinear  case  of  tempera¬ 
ture  dependent  thermal  properties.  It  is  a  sequel  to  [1]  where  the 
linear  quarter  plane  problem  with  constant  coefficients,  was  tho¬ 
roughly  analyzed.  As  was  shown  there,  in  that  case,  the  inverse 
problem  can  be  formulated  either  as  a  Volterra  integral  equation 
of  the  first  kind,  or  equivalently,  as  an  ‘initial  value  problem 
for  the  one  dimensional  heat  equation  run  sideways.  Either  formu¬ 
lation  leads  to  an  improperly  posed  problem  in  which  the  solution, 
when  it  exists,  depends  discontinuously  on  the  data. 

The  inverse  problem  can  be  regularized  in  the  lz  norm  by  placing  an 

a-priori  bound  M  on  the  norm  of  the  unknown  temperature  history, 

-f(t),  at  the  inside  wall  x  =  0;  at  the  same  time,  the  measured  noisy 

temperature  data  g  (t),  at  the  location  x  =  i  >  0,  is  regarded  as 
m 

differing  by  at  most  e  in  the  L  norm  from  unknown  smooth  exact  data 
g(t),  for  which  a  solution  exists.  It  is  assumed  that  e  and  M  are 
known  and  compatible.  As  shown  in  [1,  equations  (2.20),  (2.21)]  this 
leads  to  explicit  formulae  for  the  temperature  and  gradient  histories 
at  each  fixed  x,  0  <  x  <  £.  Also,  error  estimates  are  obtained  for 
the  regularized  solutions  implying  Holder  continuity  with  respect  to 
the  data,  for  each  fixed  positive  x.  These  estimates  degenerate  at 
the  wall,  [1,  Theorem  1]. 

The  regularization  procedure  can  be  interpreted  as  solving  the 
initial  value  problem  for  the  sideways  heat  equation  with  appro¬ 
priately  modified  initial  data.  An  explicit  finite  difference 
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scheme  consistent  with  that  problem  is  shown  to  be  unconditionally 
convergent,  when  used  with  the  filtered  initial  data,  [1,  Theorem  3] 
This  step-by-step  marching  scheme  in  the  x-variable  is  the  basis  for 
our  approach  to  the  nonlinear  case  of  a  temperature-dependent  dif¬ 
fusion  coefficient.  We  regularize  the  calculation  at  each  step  by 
filtering  in  the  frequency  domain,  using  FFT  algorithms;  we  then 
return  to  the  physical  variables  for  the  calculation  of  the  next 
step.  The  filtering  function  used  at  each  step  is  that  determined 
by  the  related  constant  coefficient  problem.  This  algorithm  is 
outlined  in  [1,  Section  7]. 

In  order  to  test  the  robustness  of  this  procedure,  an  example  was 
manufactured  with  a  known  exact  solution.  This  is  a  fictitious 
mathematical  problem,  artificially  created  so  as  to  have  a  solution 
which  simulates  conditions  presumed  to  exist  in  a  155mm  cannon.  The 
relevant  parameters  were  made  available  to  us  by  Dr.  A.  K.  Cel  mi  ns, 
U.S.  Army  Ballistic  Research  Laboratory,  Aberdeen  Proving  Grounds, 
Maryland.  The  ‘'exact"  solution  was  constructed  numerically  by 
solving  a  well  posed  direct  problem  as  explained  below. 

2.  The  Direct  Problem 

Consider  the  initial  boundary  value  problem 

S  N  3  3 IJ 

(2.1)  -  =  -  [a ( u )  -],  0.<  x  <  t,  t  >  0, 

3t  3X  3  X 

(2.2)  u(O.t)  =  f(t),  u(l,t)  -  h(t),  t  >  0, 

(2.3)  u(x,0)  *  300°  K 
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where  t  is  the  time  measured  in  milliseconds,  x  measured  in 
millimeters  represents  distance  away  from  the  inside  wall,  and 
u(x,t)  is  the  temperature  in  degrees  Kelvin.  The  heat  conduction 
equation  (2.1)  is  a  simplification  of  the  actual  physical  situation, 
in  that  first  order  terms  arising  from  cylindrical  symmetry  have  been 
neglected,  as  well  as  the  variation  of  specific  heat  with  tempera¬ 
ture.  Moreover,  for  gun  steel  at  temperatures  between  300°  K  and 
1000°  K,  the  conduction  coefficient  a(u)  in  (2.1)  is  well  approxi¬ 
mated  by  a  linear  function  of  u, 

(2.4)  a(u)  =  {1.299  -  1.144  x  10‘3  (u  -  255)}  x  10'3  mm2/inillisec 

We  remark  that  the  methodology  to  be  discussed  can  easily  accommodate 
the  more  exact  differential  equation,  as  well  as  more  complicated 
dependencies  of  a(u)  on  u  .  ■  We  shall  refer  to  the  quantity 

3u 

(2.5)  w(x,t)  =  -a(u)  — 

ax 

as  the  temperature  gradient,  by  an  abuse  of  te  mi  oology.  It  is 
measured  in  mm°  K/mil 1 i seconds.  In  all  Figures  shown  below  dealing 
with  plots  of  w(x,t)  as  a  function  of  t  for  some  fixed  x  ,  the 
vertical  axis  bears  the  legend  "temperature  gradient." 

The  functions  f(t)  and  h(t)  in  (2.2)  represent,  respectively,  the 
temperature  histories  at  the  inside  wall  and  at  1mm  away  from  the 
wall.  These  mathematical  functions  are  plotted  in  Figure  1;  they 
are  constructed  so  as  to  approximate  observed  temperature  histories 
in  gun  barrels,  [4]. 
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The  direct  problem  given  by  (2.1),  (2.2),  and  (2.3)  was  solved 
numerically,  using  an  adaptive  partial  differential  equation  software 
package,  M0L1D,  [3].  The  numerical  integration  v/as  carried  out  to  a 
distance  in  time  equal  to  100  milliseconds.  The  temperature  u(x,t) 
and.  gradient  w(x,t)  were  evaluated  at  various  fixed  values  of  x 
as  functions  of  time,  and  stored  for  subsequent  comparisons.  Figures 
2  and  3  show  the  histories  of  u  and  w  at  x  =  .25mm.  As  is 
evident  from  Figure  3,  the  numerical  calculation  of  w  is  not  free 
from  noise.  Nevertheless,  we  use  the  term  "exact  solution"  for  any 
history  obtained  by  the  above  numerical  computation  of  the  direct 
problem.  All  histories  are  records  consisting  of  400  equi spaced 
samples  on  the  time  interval  [0,100]  milliseconds. 

3.  The  Inverse  Problem 

The  physical  region  of  interest  here  is  the  x  interval  between 
0  and  .25mm.  The  histories  in  Figures  2  and  3  simulate  what  might 
have  been  recorded  by  a  thermocouple  at  .25mm  away  from  the  inside 
wall  as  the  shell  is  fired.  The  object  is  to  use  such  data  to  re¬ 
construct  the  temperature  and  gradient  histories,  arbitrarily  close 
to  the  inside  wall.  In  actuality,  two  thermocouple  readings  are 
necessary  at  x  =  xQ  and  x  =  x1#  with  xQ  <  .25  <  x,;  a  well  posed 
direct  calculation,  as  in  Section  2  above,  then  yields  u  and  w  at 
x  *  .25mm.  As  noted  in  the  references  given  in  [1],  this  type  of 
inverse  problem  occurs  in  a  variety  of  heat  transfer  contexts.  The 
purpose  of  our  computational  experiment  is  as  follows: 

a)  To  demonstrate  the  feasibility  of  the  inverse  calculation 
in  a  realistic  situation  in  which  rapidly  varying  solutions 
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and  nonlinearity  play  a  role.  As  may  be  seen  from  Figure  1, 
the  postulated  temperature  at  the  wall  rises  from  300°  K  to 
almost  1000°  K  in  the  first  10  milliseconds.  In  this 
temperature  range,  the  conduction  coefficient  a(u)  given 
by  (2.4)  undergoes  a  280  percent  change. 

b)  To  ctemonsfrate  the  robustness,  of  our  algorithm  with  noisy 
data  and  a  fine  grid. 

c)  The  regularized  marching  procedure  we  shall  use  is  a 
powerful  general  method,  applicable  to  other  ill-posed 
evolutionary  partial  differential  equations,  linear  and 
nonlinear.  As  used  here,  it  is  an  adaptation  to  the  non¬ 
linear  case, of  an  algorithm  which  is  rigorously  justified 

in  the  constant  coefficient  case.  While  the  heuristic 

* 

"local  mode  analysis"  underlying  our  regularization  is 
likely  to  be  valid  in  many  other  cases  of  ill -posed 
initial  value  problems,  there  is  a  need  for  well -documented 
realistic  inverse  calculations. 

Let  z  -  i  -  x  and  let  aQ,  a1  be  positive  constants  such  that 

(3.1)  0  <  aQ  <  a(u)  <  a1  . 

Let  b(u)  =  —  ,  and  let  v(z,t)  =  u(x,t).  Using  (2.5),  we  may  write 
du 

(2.1)  as  an  equivalent  first  order  system 

(3.2)  v  =  ,  w  -  v  ,  0  <  z  <  Z,  t  >  0, 

z  a(v)  z  t 

with  the  subscript  notation  used  for  partial  derivatives. 
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to  be  integrated  in  the  direction  of  increasing  z  from  z  =  0  to 
z  =  i ;  we  use  the  initial  values  given  in  Figures  2  and  3  and  the 
following  boundary  conditions  at  t  =*  0, 

(3.3)  v(z,0)  =  300°  K,  w(z,0)  =  0. 


Let  az  be  the  increment  in  the  z-variable  and  let  i  =  (N  +  1)az. 
Let  vn(t),  wn(t)  denote,  respectively,  v(nAz,t),  w(nAz,t),  for' 

0  <  n  <  M  +  1.  The  following  finite  difference  approximation  is 
second  order  accurate  and  explicit. 


(3.4) 


vn+l(t)  =  vn(t) 


AZ  wn(t) 

a(vn(t) ) 


Az2  vt(t) 
2a(vn(t)) 


az2  b(vn(t))[wn(t)]2 
2a3(vn(t)) 


(3.5) 


wn+l(t)  =  wn(t)  +  azvV)  + 

1* 


AZzW^(t) 

2a(vn(t)) 


Az2  b(vn(t))  wn(t)  v"(t) 

2a2(vn(t)) 

An  effective  way  of  implementing  this  scheme  is  to  use  cubic  spline 

interpolation  at  the  400  equally  spaced  mesh  points  on  the  time 

interval  [0,T].  Differentiating  the  spline  function  produces 

0 (At3 )  accurate  derivatives  v  (t),  w  (t)  at  these  same  mesh  points, 

t  t 

and  hence  vn+l(t),  wn+l(t)  from  (3.4),  (3.5).  The  next  step  is  to 
stabilize  this  process  by  filtering  each  of  these  functions  in  the 
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frequency  domain.  This  is  accomplished  by  dividing  the  kth  Fourier 

coefficient  by  the  precomputed  weight  Xj<,  where 

Az/J> 


(3.6)  X|<  =  (1  +  a  exp  [  i  y 


2  [  k  |  7T 


9nT 


]) 


there  id  =  (-)  is  the  L2  noise  to  signal  ratio.  See  [1]. 
M 


With  Z  =  .25mm,  the  x- interval  [0,£]  was  divided  into  450  equally 
spaced  mesh  points,  and  the  above  procedure  was  implemented  with 
a)  =  .001.  Figures  4  through  11  summarize  the  comparison  between 
exact  and  computed  solutions  at  the  interior  location  x  =*  .056mm. 

An  idea  of  the  relative  errors  in  the  calculation  is  easily  gained 
from  Figures  7  and  11.  Although  the  "logarithmic  convexity" 
estimates  in  Theorem  1  of  [1]  degenerate  at  the  wall,  the  computa¬ 
tion  was  pursued  for  450  cycles  and  approximations  to  the  temperature 
and  gradient  histories  at  the  wall  were  obtained.  These  are  shown 
in  Figures  13  and  17.  The  "exact"  temperature  and  gradient  histories 
at  the  wall  are  shown  in  Figures  12  and  16.  Clearly,  slight  in¬ 
accuracies  in  the  well-posed  direct  calculation  of  u(x,t)  near  x  =  0, 
lead  to  a  rather  noisy  determination  of  the  exact  w(x,t)  at  x  =  0;  in 
particular,  the  pronounced  spike  near  t  =  40  milliseconds  in  Figure 
16  is  a  numerical  artifact  which  should  be  disregarded.  Nonetheless, 
we  have  chosen  to  compare  the  computed  gradient  history  in  Figure  17 
with  the  wall  profile  given  in  Figure  16.  As  is  evident  from  Figures 
15  and  19,  the  wall  estimates  obtained  by  solving  the  inverse  problem 
are  quite  reliable.  This  is  especially  true  during  the  first  twenty 
or  so  milliseconds  where  peak  values  are  achieved. 
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4.  Conclusions 


A  regularized  marching  algorithm  has  been  shown  to  be  effective  in 
solving  nonlinear  inverse  heat  transfer  problems  in  gun  barrels.  In 
[2],  a  similar  technique  was  used  successfully  on  linear  backwards 
parabolic  equations  with  highly  variable  coefficients.  More 
recently,  success  has  also  been  achieved  on  other  unstable  examples 
involving  Burgers'  equation  with  the  time  direction  reversed. 

Future  work  should  be  directed  towards  problems  in  two  or  more  space 
dimensions  in  general  domains,  in  the  context  of  heat  transfer  and 
fluid  mechanics. 
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ABSTRACT 

Thi.3  paper  gives  guidelines  for  the  development  of  computer  programs  for 
the  numerical  simulation  of  semiconductor  devices.  For  this  purpose  the  basic 
mathematical  results  on  the  corresponding  elliptic  boundary  value  problem  are 
reviewed.  In  particular,  existence,  smoothness  and  structure  of  the  solutions 
of  the  fundamental  semiconductor  equations  are  discussed.  Various  feasible 
approaches  to  the  numerical  solution  of  the  semiconductor  equations  are  described. 
Much  emphasis  is  placed  on  constructive  remarks  to  help  authors  of  device  Simula- 
tion  programs  to  make  decisions  on  their  code  design  problems.  In  particular, 
CI,i^e^ia  for  an  optimal  mesh  generation  strategy  are  given.  The  iterative  solution 
of  the  systems  of  nonlinear  and  linear  equations  obtained  by  discretising  the  semi¬ 
conductor  equations  is  discussed.  An  example  is  given  showing  the  power  of  these 
concepts  combined  with  modern  numerical  methods  in  comparison  to  classical  approaches. 
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ABOUT  THE  SEMICONDUCTOR  EQUATIONS  ON  DEVICE  MODELING  PROGRAMS 

*  ** 

Ch.  Ringhofer  and  S.  Selberherr 


1.  Introduction 


The  characteristic  feature  of  early  device  modeling  is  the 
separation  of  the' interior  of  the  device  into  different  regions, 
the  treatment  of  which  could  be  simplified  by  various  assumptions 
like  special  doping  profiles,  complete  depletion  and 
quasineutrality.  These  separately  treated  regions  were  simply 
put  together  to  produce  the  overall  solution.  If  results  in  an 
analytically  closed  form  are  intended,  any  other  approach  is 
prohibitive.  Fully  numerical  modeling  based  on  partial 
differential  equations  /6 1/  which  describe  all  different  regions 
of  semiconductor  devices  in  one  unified  manner  was  first 
suggested  by  Gununel  /29/  for  the  one  dimensional  bipolar 
transistor.  This  approach  was  further  developed  and  applied  to 
pn-junction  theory  by  De  Mari  /13/,  /14/  and  to  IMPATT  diodes  by 
Schar fetter  and  Gummel  /50/. 

A  two  dimensional  numerical  analysis  of  a  semiconductor 
device  was  carried  out  first  by  Kennedy  and  O'Brien  /35/  who 
investigated  the  junction  field  effect  transistor.  Since  then 
two  dimensional  modeling  has  been  applied  to  fairly  all  important 
semiconductor  devices.  There  are  so  many  papers  of  excellent 
repute  that  it  would  be  unfair  to  cite  only  a  few.  Recently  also 
the  first  results  on  three  dimensional  device  modeling  have  been 
published.  Time  dependence  has  been  investigated  by  e.g.  /37/, 
/44/  and  models  in  three  space  dimensions  have  been  announced  by 
e.g.  /8/ ,  /XI/,  /67/,  /68/. 
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In  spite  of  all  these  important  and  successful  activities, 
the  need  for  economic  and  highly  user  oriented  computer  programs 
became  more  and  more  apparent  in  the  field  of  device  modeling* 
Especially  for  MOS  devices  which  have  evolved  since  their 
invention  by  Kahng  and  Atalla  /32/  to  an  incredible  standard, 
modeling  in  two  space  dimensions  has  become  inherently  important 
because  current  flow  controlled  by  a  perpendicular  field  is  an 
intrinsically  two  dimensional  problem.  One  such  program  which 
has  beer*  applied  successfully  in  many  laboratories  is  called 
CADDET  /59/.  We  have  also  tried  to  bridge  that  gap  and  developed 
MINIMOS  /53/,  /51/  for  the  two  dimensional  static  analysis  of 
planar  MOS  transistors. 

2.  Analysis  of  the  Static  Semiconductor  Equations 

In  this  chapter  we  review  some  of  the  existing  analytical 
results  for  the  fundamental  semiconductor  equations  concerning 
existence  and  structure  of  their  solutions.  These  results  are  of 
importance  in  both  the  theoretical  and  practical  context,  since  - 
as  we  will  see  in  the  next  chapter  -  the  knowledge,  of  the 
structure  and  smoothness  properties  of  solutions  is  indeed 
essential  for  the  development  of  a  numerical  solution  method. 
The  most  familiar  model  of  carrier  transport  in  a  semiconductor 
device  has  been  proposed  by  Van  Roosbroeck  /61/.  It  consists  of 
Poisson's  equation  (2.1),  the  current  continuity  equations  for 
electrons  (2.2)  and  holes  (2.3)  and  the  current  relations  for 


electrons  (2.4)  and 

holes  (2.5) 

div  €*grad  =  -q*  ( 

p  -  n  +  C  ) 

(2.1) 

* 

(2.2) 

div  Jn  =  -q* R 

div  -  q-R 

P 

(2.3) 

=  -q*  (  J»n*n-grad 

-  D^^grad  n  ) 

(2.4) 

Jp  =  -q* (  Pp-p-gtad 

V  +  D^grad  p  ) 

in 

CM 
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These  relations  form  a  system  of  coupled  partial 
differential  equations.  Poisson's  equation,  coming  from 
Maxwell's  laws,  describes  the  charge  distribution  in  the  interior 
of  a  semiconductor  device*  The  balance  of  sinks  and  sources  for 
electron-  and  hole  currents  is  characterized  by  the  continuity 
equations.  The  current  relations  describe  the  absolute  value, 
direction  and  orientation  of  electron-  and  hole  currents*  The 
continuity  equations  and  the  current  relations  can  be  derived 
from  Boltzmann's  equation  by  not  at  all  trivial  means.  It  is  not 
our  intention  to  present  in  this  paper  the  ideas  behind  these 
considerations*  The  interested  reader  is  refered  to  /61/  and  its 
secondary  literature  or  text  books  on  semiconductor  physics  e.g. 
/7/,  /31/ ,  /52/ ,  /56/ • 

2.1  The  Validity  of  the  Basic  Semiconductor  Equations 

It  is  of  prime  importance  to  be  aware  that  equations  (2.4) 
and  (2.5)  are  not  capable  to  describe  exactly  all  phenomena 
occuring  in  real  devices.  For  instance,  they  do  not  characterize 
effects  which  are  caused  by  degenerate  semiconductors  (e.g.  heavy 
doping) .  /38/,  /60/,  /63/  discuss  some  modifications  of  the 
current  relations,  which  partially  take  into  account  the 
consequences  introduced  by  degenerate  semiconductors  (e.g. 
invalidity  of  Boltzmann's  statistics,  bandgap  narrowing).  These 
modifications  are  not  at  all  simple  and  lead  to  problems 
especially  in  the  formulation  of  boundary  conditions  /47/,  /62/. 
In  case  of  modeling  .  MOS  devices,  degeneracy,  owing  to  the 
relatively  low  doping  in  the,  channel  region,  is  practically 
irrelevant.  For  modern  bipolar  devices,  though,  bearing  in  mind 
shallow  and  extraordinarily  heavily  doped  emitters,  it  is  an 
absolute  necessity  to  account  for  local  degeneracy  of  the 
semiconductor. 

Just  as  further  examples  (2.4)  and  (2.5)  do  not  describe 
velocity  overshoot  phenomena  which  become  apparent  at  feature 
lengths  of  O.ljhn  for  silicon  and  IJJm  for  gallium-arsenide  /25/. 
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Certainly  no  effects  which  are  due  to  ballistic  transport  (the 
existence  of  which  is  still  questionable  /30/)  are  included.  The 
latter  start  to  become  important  for  feature  sizes  below  O.OlJ/m 
for  silicon  and  O.l^m  for  gallium-arsenide  /26/.  Considering  the 
state  of  the  art  of  device  miniaturization,  neither  effect  has  to 
bother  the  modelists  of  silicon  devices.  For  gallium-arsenide 
devices  new  ideas  are  mandatory  in  the  near  future  /25/,  /46/, 
/  45/ . 


2.2  Domain  and  Boundary  Conditions 

Most  of  the  existing  programs  which  solve  the  semiconductor 
equations  are  restricted  to  a  rectangular  device  geometry*  This 
is  not  essential  as  far  as  the  analysis  of  the  equations  is 
concerned.  In  this  chapter  we  shall  assume  that  the  equations 
( 2 . 1 ) - ( 2 . 5 )  are  posed  in  a  domain  D  of  Rn  (n=l,2,3)  with  a 
piecewise  smooth  boundary  3d.  Equations  (2.1)-(2.5)  are  subject 
to  a  mixed  set  of  Dirichlet  and  Neumann  boundary  conditions. 
That  means  3d  consists  of  three  parts  .  3d. 

denotes  the'  part  of  the  boundary  where  the  device  is  surrounded 
by  insulating  material.  There  one  assumes  the  boundary 
conditions : 

3*P/3nJj=  3n/3nX=  3p/3nJj=  0  (2.6) 

Here  nj_  denotes  the  unit  normal  vector  on  3d  which  exists 
anywhere  except  at  a  finite  number  of  points  (arbitrarily  defined 
corners  of  the  simulation  geometry) .  3d2  denotes  the  part  of  the 

boundary  corresponding  to  the  ohmic  contacts.  There  q#,  n  and  p 
are  prescribed.  The  boundary  conditions  can  be  derived  from  the 
applied  bias  and  the  assumptions  of  thermal  equilibrium  and 
vanishing  space  charge: 

*  =  %  +  %uilt-in'  n’?  =  ni2'  n  “  P  "  c  K  0  <2-7> 

The  last  two  conditions  in  (2.7)  can  be  rewritten  as: 
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(2.8) 


n  «  C2+4*n.2  +  C)/2 

p  *  (i|  C2+4*n^2  -  C)/2 

In  many  applications  it  is  desired  to  consider  controlled 
insulator-semiconductor  interfaces  (e,g,  MOS  devices) ,  So  3d^ 
denotes  the-  part  of  the  boundary  which  corresponds  to  such  an 
interface.  There  we  have  the  interface  conditions: 


(2,9) 


Again  nl  denotes  the  normal  vector  on  3d.  C  and  _ 
7  sem  ins 

denote  the  permittivity  constants  for  the  semiconductor  and  the 

insulator  respectively.  34>/3n||  and  SV/Dnl  |  .  denote  the 

‘sem  ‘ins 

onesided  limits  of  the  derivatives  perpendicular  to  the  interface 
approaching  the  interface.  Within  the  insulator  the  Laplace 
equation:  div  grad  ^  =  0  holds. 


2.3  Dependent  Variables 

For  analytical  purposes  it  is  often  useful  to  use  other 
variables  than  n  and  p  to  describe  the  system  (2.1)— (2-5) *  Two 
other  sets  of  variables  which  are  frequently  employed  are 
and  W,u,v)  which  relate  to  the  set  Wn,p)  by: 


n  =  n.-e^nJ/Ut,  p  =  n.  • ,  /Ut 


n  -  n^e^^-u,  p  s  n^e 


(2.10) 

(2.11) 


(2.10)  can  be  physically  interpreted  as  the  application  of 
Boltzmann  statistics.  However  (2,10)  also  can  be  regarded  as  a 
purely  mathematical  change  of  variables  so  that  the  question  of 


360 


the  validity  of  the  Boltzmann  statistics  does  not  need  to  be 
considered.  The  use  of  (V*Vn/Vp)  a  priori  excludes  negative 
carrier  densities  n  and  p ,  which  may  .be  present  as  undesired 
nonphysical  solutions  of  (2.1) -(2.5)  if  we  use  (H>/n,p)  or  (q*,u,v) 
as  dependent  variables.  As  we  will  see  later  in  this  chapter  the 
advantage  of  the  set  (qf,  u,v)  is  that  the  continuity  equations 
(2.2) ,  (2.3)  and  current  relations  (2.4) ,  (2.5)  become 

self-adjoint.  This  also  has  an  important  impact  on  the  use  of 
iterative  schemes  for  the  solution  of  the  evolving  linear  systems 
(cf.  chapter  4).  However ,  owing  to  the  enormous  range  of  the 
values  of  u  and  v,  the  sets  (q*,  n,p)  or  )  have  to  be 

prefered  for  actual  computations.  We  personally  favour  the  set 
(V,n,p). 

2.4  The  Existence  of  Solutions  and  Scaling 

The  basic  answer  to  the  question  of  existence  of  solutions 
can  be  found  in  Mock  /43/  or  under  slightly  different  assumptions 
in  Bank,  Jerome  and  Rose  /5/.  Both  proofs  are  based  on 
Schauder's  fixpoint  theorem.  They  are  both  valid  for  arbitrarily 
shaped  domains  and  boundary  conditions  of  the  type  previously 
described  without  an  interface  (3d3={}).  Both  papers  consider 
the  case  of  vanishing  generation/recombination  rate  (R=0  in 
(2.2),  (2.3)).  In  the  setting  of  Mock  (q»,u,v)  is  used  as 

dependent  variables.  The  equations  are  scaled  so  that  the 

intrinsic  carrier  density  n^,  the  thermal  voltage  Ut  and  the 

ratio  elementary  charge/permittivity  are  equal  to  unity.  Thus, 
combining  the  continuity  equations  (2.2),  (2.3)  and  current 

relations  (2.4),  (2.5),  we  have  the  system: 

div  grad  H*  *  e^*u  -  e*^*v  -  c  (2.12) 

div  (e^-grad  u)  =  0  (2.13) 

div  (e“^-grad  v)  =  0  (2.14) 
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Then  a  map  M:4*  ♦  y  is  defined  (details  in  /43/  or  /4/)  such 
that  the  evaluation  of  M  requires  the  solution  of  (2.13)  and 
(2.14)  and  a  fixpoint  4>*  of  M  (M^Vh1*)  together  with  the 
according  functions  (u,v)  is  a  solution  of  the  whole  system 
(2. 12) - (2. 14 ) .  The  existence  of  a  fixpoint  is  shown  by 
Schauder's  fixpoint  theorem.  Questions  concerning  the  degree  of 
smoothness  of  these  solutions  (the  existence  of  derivatives)  are 
discussed  in  /42/, 

However,  Schauder’s  theorem  is  not  constructive  and  does  not 
indicate  that  iterating  the  map  M  will  actually  lead  to  the 
fixpoint.  Moreover,  it  does  not  give  any  information  about  the 
structure  of  the  solution  which  is  of  vital  interest  for  actual 
computations.  Since  the  dependent  variables  in  the  system 
(2.1) -(2.5)  are  of  different  order  of  magnitude  and  show  a 
strongly  different  behaviour  in  regions  with  small  and  large 
space  charge  the  first  step  towards  a  structural  analysis  of 
(2.1) -(2. 5)  has  to  be  an  appropriate  scaling.  A  standard  way  of 
scaling  (2.1)-(2.5)  has  been  given  by  De  Mari  /14/.  There  4*  is 
scaled  by  the  thermal  voltage  (J^/  n  and  p  are  scaled  by  n ^ 
(similar  to  Mock  /43/)  and  the  independent  variables  are  scaled 
such  that  all  multipying  constants  in  Poisson's  equation  become 
unity.*  Although  physically  reasonable  this  approach  has  the 
disadvantage  that  n  and  p  in  general  are  still  several  orders  of 
magnitude  larger  than  4*.  A  scaling  which  reduces  V/  n  and  p  to 
the  same  order  of  magnitude  has  been  given  by  Vasiliev'a  and 
Butuzov  /65/.  This  approach  makes  the  system  (2.1) -(2.5) 
accessible  to  an  asymptotic  analysis  which  is  given  together  with 
applications  in  /AO/,  /AX/  and  /39/.  There  n  and  p  are  scaled  by 
the  maximum  absolute  value  of  the  net  doping  C  and  the 
independent  variables  are  scaled  by  the  characteristic  length  of 


the  device. 

employed . 

More 

precisely  the  following  scaling 

factors  are 

quantity 

mX 

symbol 

value 

-v  _ 

X 

1 

max(x-y) ,  x,y  m  D 

s» 

«t 

k-T/q 

(2.15) 

n,p 

<£ 

max | C | 
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After  scaling  the  equations  become: 

X2*div  grad  =  n  -  p  -  C  (2.16) 

div  (  grad  n  -  n-grad  )  -  -R 
div  (  grad  p  +  p-grad  ^  )  =  -R 


following  analysis  also 

smooth  dependence  of  p  and  p  on  n,  p  and 

n  p  * 


Here,  for  simplicity  only,  p^  and  J*  have  been  assumed  to  be 
constant.  It  should  be  noted  that  the 
holds  if  the  usual 
grad  H*  e.g./54/  is  assumed.  Since  the  independent  variable  x  has 
been  scaled,  equations  (2.16)  are  now  posed  on  a  domain  Ds  with 
maximal  diameter  equal  to  one.  The  small  constant  X2  multiplying 
the  Laplacian  in  (2.16)  is  the  minimal  Debye  length  of  the 
device: 


z  •Ot 

l2*q*<f 


(2.17) 


1  and  tf  are  defined  in  (2.15)  .  Thus  for  high  doping  («(>>1) 
X  will  be  small.  For  instance  for  a  silicon  device  with 
characteristic  length  25J>m  and  «£=1020cnf3  we  compute  for  X2  at 
approximate  room  temperature  T=300K:  X2=4.10-10. 


R  denotes  again  the  scaled  generation/recombination  rate. 
In  the  analysis  given  in  /41/  the  usual  Shockley-Read-Hall  term 
has  been  used  which  after  scaling  is  of  the  form: 


R  =  n"P  z 

n  +  P  +  2-  [rX)2 


r»  1/2 


(2.18) 


R  is  in  general  a  (not  necessarily  mildly)  nonlinear 
function  of  n,p  and  gradtp.  Thus  different  models  of  R  may 
influence  the  analytical  results  quite  drastically.  This  is 
obviously  to  be  expected  as  in  many  operating  conditions  the 
device  behaviour  depends  *  strongly  on  the  net 

generation/recombination  R. 
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2.5  The  Singular  Perturbation  Approach 

(2.16)  represents  a  singularly  perturbed  elliptic  system 
with  perturbation  parameter  X.  The  advantage  of  this 
interpretation  is  that  we  can  now  obtain  information  about  the 
structure  of  solutions  of  (2.16)  by  using  asymptotic  expansions: 
In  the  subdomains  of  Ds  where  the  solutions  behave  smoothly  we 
expand  them  into  power  series  of  the  form: 

oo 

w(x,X)  =  2w7 (x) -X*,  w=(V,n,p)T  (2.19) 

i=0 

which  implies  a  smooth  dependence  on  X.  C  -  the  scaled  doping  - 
is  smooth  in  these  subdomains  and  exhibits  a  sharp  transition 
across  the  pn-junctions  in  the  device.  For  the  case  of  an  abrupt 
junction  this  behaviour  is  represented  by  a  discontinuity  across 
an  n-1  dimensional  manifold  P:(x=x(s)r  s  of  Rn  in  the  device. 
Thus  P  is  a  point  in  1  dimension,  a  curve  in  2  dimensions  and  a 
surface  in  3  dimensions.  Of  course  one  curve  or  surface  has  to 
be  used  for  each  junction.  Since  the  procedure  is  the  same  for 
each  of  the  junctions  it  is  demonstrated  only  for  one  junction. 
In  the  case  of  an  exponentially  graded  doping  profile  C  consists 
of  two  parts: 

C  =  C“  +  C~  (2.20) 

where  C"and  C*  are  discontinous ,  C~  is  piecewise  smooth  and  C  is 
exponentially  decaying  to  zero  away  from  P„  In  the  vicinity  of  P 
the  expansion  (2.19)  is  not  valid  and  has  to  be  supplemented  by  a 
"layer"  term  acording  to  the  singular  perturbation  analysis: 

oo 

w(x,X)  =E[w^(x)  +  w7  (s,  t/X)  ]  •X*',  w-0P,n,p)T  (2.21) 

i=0 
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Here  the  following  coordinate  transformation  has  been 

employed:  For  a  point  in  the  vicinity  of  P  s  denotes  the 

parameter  value  at  the  nearest  point  on  P  and  t  denotes  its 
distance  perpendicular  to  P  (cf.  Fig*  1).  Thus  the  solution  of 
the  semiconductor  equations  exhibits  internal  layers  at 
pn- junctions. 

The  wT  and  in  (2.21)  can  now  be  determined  separately  and 
the  structure  of  the  solution  is  given  by  its  partition  into  the 
smooth  part  EwT'A1  and  its  rapidly  varying  part  has 

to  satisfy  the  reduced  equations: 

0  K  no  ~  po  (2*22) 

aiv  (grad  n“  -  n“-grad*}T)  =  -R~  (2.23) 

div  (grad  p~  +  p“‘grad^T)  =  -r“  (2.24) 

For  the  sake  of  simplicity  but  without  loss  of  generality 
the  mobilities  p^  and  p^  have  been  assumed  to  be  constant. 
(2.22) -(2.24)  is  subject  to  the  boundary  conditions  (2.6)-{2.9). 
Of  course  the  condition  of  vanishing  space  charge  is  redundant 
with  (2.22).  Since  C“  is  discontinous  at  P  and  (2 . 22) - (2 . 24) 
represents  a  second  order  system  of  two  equations  four  "interface 
conditions"  have  to  be  imposed  at  P.  They  are  of  the  form: 


1  x=x+ 

(2.25) 

=  Po*e<&|.»  -» 

1  x=x+ 

(2.26) 

a 

J  ^  II 

=  S«o-ll 

(2.27) 

p°'nllx=i- 

= 

(2.28) 

where  w|-  and  w|«*  denote  the  onesided  limits  of  w  as  x  tends  to 

A  a  *  ^ 

P  from  each  side.  nl  denotes  the  unit  normal  vector  on  P.  J" 

**  no 

and  are  the  zeroth  order  terms  of  the  smooth  parts  of  the 

Po 

(scaled)  electron  and  hole  current  densities. 
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(2.29) 


J„0  *  9rad  no  “  nj'grad  «}r 

Jp  •“  grad  p3  +  Po'grad  qr 

(2. 22)-(2.24)  together  with  (2. 25) - (2.28)  and  the  boundary 
conditions  (2. 6) -(2. 9)  define  the  reduced  problem  whose  solution 
is  an  0(X)  approximation  to  the  full  solution  away  from  P.  As  we 
will  see  in  the  next  chapter  the  reduced  problem  is  a  useful  tool 
for  the  development  and  analysis  of  numerical  methods ,  since  it 
(especially  the  conditions  (2.25) -(2.28) )  has  to  be  solved 
implicitly  by  any  discretisation  method  which  requires  a 
reasonable  number  of  grid  points. 

The  equations  for  the  rapidly  varying  parts  w?  reduce  to 
ordinary  differential  equations.  That  means  that  only 
derivatives  with  respect  to  the  "fast"  variable  t/X  occur.  Since 
the  rate  of  decay  of  wt  depends  heavily  on  ^  the  width  of  the 
layer  grows  with  the  applied  voltage;  a  fact  which  is  absolutely 
well  known  by  device  physicists,  but  which,  becomes  nicely 
apparent  by  the  singular  perturbation  approach. 

3.  Numerical  Solution  of  the  Semiconductor  Equations 

In  this  chapter  we  discuss  some  of  the  problems  occuring  in 
the  numerical  solution  of  the  semiconductor  equations  and  the 
analysis  of  existing  numerical  methods.  From  the  viewpoint  of 
numerical  analysis  there  are  essentially  four  major  topics  to  be 
considered.  The  first  one  is  the  type  of  discretisation  to  be 
used.  There  exist  programs  for  both  Finite  Element  and  Finite 
Difference  discretisations  of  the  system  (2.1)-(2.5).  As 
outlined  in  the  previous  chapter  the  solution  exhibits  a  smooth 
behaviour  in  some  subregions  of  the  domain  whereas  in  others  it 
varies  rapidly.  Thus  a  nonuniform  mesh  is  mandatory  and  adaptive 
mesh  refinement  is  desirable.  So  the  second  topic  is  the 
question  how  to  set  up  the  mesh  refinement  algorithm  i.e.  which 
quantities  have  to  be  used  to  control  the  mesh.  Each  type  of 
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discretisation  will  lead  to  a  large  sparse  system  of  nonlinear 
equations  and  so  the  solution  of  this  system  is  the  third  topic. 
As  fourth  topic  we  discuss  linear  equations  solvers  which  have  to 
be  used  in  topic  three*  For  topics  one  to  three  many  methods 
have  been  designed  especially  for  the  semiconductor  equations. 
These  points  will  be  discussed  in  this  chapter.  For  topic  four 
standard  numerical  analysis  is  commonly  used  and  so  its 
discussion  will  be  deferred  to  chapter  four.  For  the  sake  of 
simplicity  in  nomenclature  we  shall  only  consider  the 
two-dimensional  case  in  this  chapter.  However,  all  results  given 
in  the  following  can  be  generalized  to  three  dimensions  in.  a 
straightforward  manner.  So,  the  equations  are  posed  in  a  domain 
D  of  E2  and  x  *  (x,y)T  denotes  the  independent  variable. 


3.1  Discretisation  Schemes 

Using  Finite  Elements  or  Finite  Differences  one  has  to  take 
into  account  that  Poisson's  equation  (2.1)  is  of  a  different  type 
than  the  continuity  equations.  Poisson's  equation  -  in  the 
scaling  of  Markowich  /4Q/  using  the  variables 

X^*div  grad  -  e^*u  -  e”^*v  -  C  (3.1) 

is  a  singularly  perturbed  elliptic  problem  whose  right  hand  side 
has  a  positive  derivative  with  respect  to  Thus  it  is  of  a 

standard  form  (as  discussed  in  e.g.  /22/)  except  for  the 
discontinous  or  exponentially  graded  term  C.  Equations  of  that 
type  are  generally  well  behaved  and  it  suffices  to  apply  a  usual 
discretisation  scheme*  In  the  case  of  Finite  Differences 
equation  (3.1)  is  discretized  by: 

X2*  (div  gradh'Wi^  =  -  C(xA,y^)  <3. 2) 
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Ei+l/2,j  * 

EI,j+l/2  - 

hi  -  *i+rxi*  kj  =>j+i-yj 
(div  grad  S*i#j  =  2*C^+1/2,3 

+2-<Ei,j+l/2 


(3.3) 


Ei,j-l/2)/(kj+kj-l) 


(3.4) 


Here  ^  and  p^  denote  the  approximations  to  <4>,  n  and 
p  at  the  gridpoint  (x^y^)  .  Ei+i/2,j  denotes  the  value  of  3h»/Ux 
at  (xi+1/2B(xi+xi+1)/2,  Yj )  -  Ei,j+i/2  denotes  the  value  of  3H>/3y 
at  (x.,  yj+i/2=(yj+yj+l)/2) •'  If  on*  of  the  neighbouring 
gridpoints  (xi+1.yj)r  (xi_1,yj)  ,  (xifyj+1)  ,  does  not 
exist  -  as  possible  in  a  terminating  line  approach  /!/ ,  /2/  or  in 
the  Finite  Boxes  approach  /24/  -  (3,4)  has  to  be  modified.  We 
will  go  into  some  detail  concerning  these  modifications  in  the 
next  section.  In  the  case  of  Finite  Elements  classical  shape 
functions  can  be  used  (i.e.  linear  shape  functions  for  triangular 
elements ,  bilinear  shape  functions  for  rectangular  elements) . 

It  turns  out  that  the  discretisation  of  the  continuity 
equations  is  more  crucial  than  the  discretisation  of  Poissons 's 
equation.  The  usual  error  analysis  of  discretisation  methods 
provides  an  error  estimate  of  the  form: 


max  |wh~w|  <*  c-H  (3,5) 

T 

wh  denotes  the  numerical  approximation  to  w (x,y)  -  (H*,  n , p)  - 
H  denotes  the  maximal  gridspacing.  The  constant  c  will  in 
general  depend  on  the  higher  -order  derivatives  of  w.  The 

singular  perturbation  analysis  /41/  shows  that  derivatives  of  ^ , 
n~  and  pA  in  (2.21)  are  of  magnitude  0(X  3)  -  0(X  4)  locally  near 

the  junction  (X  is  defined  in  (2.17)).  /41/  shows  also  that, 

even  if  a  nonuniform  mesh  is  used,  the  amount  of  gridpoints 
required  to  equidistr ibute  the  error  term  in  (3-5)  can  be 

proportional  to  X~2  which  is  of  course  prohibitive.  Therefore  a 
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discretisation  scheme  is  needed  where  the  constant  c  in  (3.5) 
does  not  depend  on  the  higher  derivatives  of  the  rapidly  varying 
terms  qr,  n*  and  pA.  For  the  case  of  Finite  Differences  such  a 
scheme  was  given  by  Scharfetter  and  Gummel  /50/*  They 
approximate: 

Jn  =  grad  n  -  n*grad  V  (3.6) 
div  Jn  -  3j*/3x  +  3 J^/3y  =  R  (3.7) 
by: 


^(s)  =  s-coth(s) 

tX 


2’(Jni+l/2,j  “  Jni-i/2f j)/(hi+hi-i>  + 

+  2* {J"i» j+1/2  "  Jni,j-i/2)/(kj+kj-i)  =  Ri,j 


(3.8) 


(3.9) 


J*i+l/2,j  denotes  the  value  of  J*  at  (xi+1 ,2= (^i+xi+1) /2 , 
Yj)*  Jni,j+l/2  denotes  the  value  of  at  {xi , 
y j+l/2= (y j+y j+i) /2) .  The  continuity  equation  for  holes  is 
discretized  analogously,  Scharfetter  and  Gummel  gave  a  physical 
reasoning  for  the  derivation  of  their  scheme.  Markowich  et  al. 
/41/  proved  that  in  one  dimension  the  Scharf etter-Gummel  scheme 
is  uniformly  convergent.  That  means  that  the  error  constant  c  in 
(3.5)  does  not  depend  on  the  derivatives  of  qT ,  nA  and  p~  in 
(2.21)  and  therefore  not  on  X.  For  two  dimensions  /41/  shows 
that  the  choice  y(s)  *  s*coth(s)  is  necessary  for  uniform 
convergence.  Exponentially  fitted  schemes  like  the  Scharf etter- 
Gummel  scheme  have  been  analyzed  by  Kellog  /34/,  /33/  and  Doolan 
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/17 /  (for  different  classes  of  problems) .  The  reason  for  the 
uniform  convergence  of  these  schemes  is  that  inside  the 
pn-junction  layers  the  interface  conditions  (2,25)  and  (2.26)  are 
satisfied  automatically  if  |grad*p|  is  large  and  the  gridspacing 
is  not  0  (X)  . 

The  results  for  Finite  Difference  schemes  suggest  that  a 
similiar  approach  (like  the  exponentially  fitted  schemes)  should 
be  used  in  the  case  of  Finite  Elements.  This  fact  has  been 
intuitively  observed  by  Engel  /21/  for  the  one-dimensional  case. 
A  modeling  group  at  IBM  has  tried  to  make  use  of  the  Scharfetter- 
Gummel  scheme  for  Finite  Elements  in  two  and  three  space 
dimensions  /9/,  /8/f  /12/.  However,  we  have  the  impression  that 
their  approach  needs  still  quite  a  bit  of  analysis,  although  it 
has  been  used  effectively  by  other  modelists  too  e.g.  /49/. 
Macheck  /36/  has  tried  to  develop  a  more  rigorous  discretisation 
for  Finite  Elements  using  exponentially  fitted  shape  functions. 
He  uses  classical  bilinear  shape  functions  for  and 

*l(x,y)  *  tl  -  Vi(x,y)]*[l  -  (3.11) 

<f2<*,y)  =  Vl(x»y)  -  V2<*,y)] 

*3(x,y)  =*  9l(x,y)  *  <f>2(x,y) 

*4 (x,y)  =  (1  -  Vl(x,y)l-  V^(x,y) 

for  u,  and 

*l<x,y)]*[l  -  <^2(x,y)]  (3.12) 

^l(x,y)  • [1  -  *2 (x,y) ] 

<ri(x,y)  •  *2  (x.,y) 

*l(x,y)  3 *  <r2  (xf y) 

for  v,  where 


Pi (x,y)  -  [1  - 
?2  (x ,y)  - 
p3<x,y)  = 

P4 (x,y)  -  (1  * 
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(3.13) 


Viix.y)  =  f(x,g^) 

V2(x,y)  =  f(y,^) 

»l(x»y)  =  f(x,-  J^) 

<T2(x,y)  “  f(yr-  |^) 

with;  f(x,a)  =  (exp (ax) -1) /(exp (a) -1)  (3.14) 

The  advantage  of  these  shape  functions  is  that  they 

accomodate  nicely  the  layer  behaviour  of  the  solution.  They 
degenerate  into  the  ordinary  bilinear  shape  functions  when  the 
electric  potential  is  constant.  In  order  to  be  able  to  switch 
from  coarse  to  fine  grid  spacing  in  different  subdomains 
transition  elements  have  to  be  used  (as  outlined  in  the  next 
section).  However,  no  theoretical  investigations  have  been 

carried  out  so  far  to  analyse  the  uniform  convergence  properties 
of  this  method.. 

3.2  Grid  Construction 

Since  subregions  of  strong  variation  of  tp,  n  and  p  alternate 
with  regions  where  these  quantities  behave  smoothly  (i.e.  their 

gradients  are  small)  different  meshsizes  are  mandatory  in  these 

subregions.  Thus  the  discretisation  scheme  should  be  able  to 
switch  locally  from  a  coarser  to  a  finer  grid.  For  the 
exponentially  fitted  (Schar f etter-Gumrael)  Finite  Difference 
discretisation  schemes  this  is  done  by  the  Finite  Boxes  approach 
/24/ .  Grid  lines  can  terminate  when  the  mesh  is  likely  to  be 
coarsened  (cf.  Fig. 2,).  The  point  (x^+^,yj)  does  not  belong  to 
the  mesh.  Thus  the  equations  for  the  point  (x. ,y.)  have  to  be 

modified  since  ni+i,j  and  pi+l,j  are  not  available.  This 

is  done  by  proper  interpolation  between  the  (j-l)-st  and  (j+l)-st 
y-level.  So  (div  grad  ^  is  approximated  by: 
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(div  grad  .  •  * 

1  F  J 

=  2*<<kj-l,Ei+l/2,j+l  +  kj'Ei+l/2,j-l>/(kj+kj-l) 

-  Ei-l/2,j)/(hi+hi-l>  + 

+  2*<Ei,j+l/2  ‘  Ei,j-l/2)/(kj+kj-l)  (3'15) 

in  (3.3).  The 


Ei-l/2,j'  •  Ei,j+l/2 


etc,  are 


defined 


continuity  equations  are  approximated  by: 


2-<<kj-l-JSi+l/2,j+l  +  kj 


-  J* 


ni-1/2, j)/(hi+hi-l) 


■J*i+l/2,j-l)/<kj+kj-l) 


2’(Jni,j+l/2  "  Jni,j-l/2)/(kj+kj-l1  “  Ri,j 


(3. IS) 


J*i-l/2  j'  j+1/2  etc*  are  define<a  in  (8«8)-  For  reasons 
of  numerical  stability  only  one  gridline  is  allowed  to  terminate 
at  a  box.  This  approach  is  a  generalisation  of  the  "Terminating 
Line"  approach  introduced  by  Adler  /!/ ,  /2/  as  already  mentioned. 


In  the  Finite  Element  approach  of  Macheck  /36/  transition 
elements  composed  of  three  triangles  are  used  to  coarsen  the  mesh 
locally  (cf.  Fig. 3).  Within  these  triangles  a  different  set  of 
shape  functions  has  to  be  used.  Thev  are  derived  by  holding  the 

j  ji 

current  densities  J  and  J  constant  along  the  edges  of  a 

n  p 

triangle  similar  to  the  approach  of  /10/. 

In  the  Finite  Element  as  well  as  in  the  Finite  Difference 
(Boxes)  approach  the  question  arises  which  criteria  should  be 
used  to  generate  the  mesh.  If  the  user  of  a  simulation  program 
has  to  define  his  elements  or  nodes  a  priori  as  input  parameters, 
this  could  perhaps  be  done  by  experience  /IQ/*  However,  if  -  as 
it  is  the  case  for  modern  user  oriented  programs  -  an  adaptive 
mesh  selection  is  desired  mathematically  formulated  criteria  are 
a  "sine  qua  non".  Generally  such  criteria  should  satisfy  two 
conditions.  Firstly  they  should  not  cause  the  program  to 
construct  more  gr idpoints/elements  than  necessary  to  achieve  a 
certain  accuracy.  Secondly  they  should  guarantee  that  a 


372 


prescribed  relative  accuracy  <f  is  really  achieved  once  they  are 
satisfied*  A  usual  way  to  design  adaptive  mesh  refinement 
procedures  is  to  equidistr ibute  the  local  truncation  error  of  the 
discretisation  scheme.  In  the  case  of  Finite  Differences  this 
error  is  proportional  to  the  meshsize  and  the  third  and  fourth 
derivatives  of  qj,  n  and  p.  Markowich  /41/  however  showed  that  it 
is  practically  not  possible  to  equidistr ibute  this  quantity.  In 
the  case  of  a  simple  MOS-transistor  0«f  X~  )  gridpoints  would  be 
required.  On  the  other  hand  the  singular  perturbation  analysis 
shows  that  the  solution  of  the  difference  scheme  approximates  the 
solution  of  the  reduced  problem  (2.22) - (2, 24)  even  if  this 
criterion  is  not  satisfied  inside  the  layer  regions  (inversion 
layer  and  space  charge  regions) .  Therefore  the  quantity  to  be 
equidistr ibuted  is  the  discretisation  error  of  Poisson’s  equation 
(i.e.  the  partial  derivatives  of  the  space  charge  times  the 
meshsizes) .  This  equidistr ibution  can  be  relaxed  inside  the 
pn-junction  layers  by  e.g.  simply  limiting  the  number  of 
gridpoints  there. 

* 

3.3  Linearisation  Schemes 

Each  discretisation  scheme  (Finite  -  Differences  or  Finite 
Elements)  will-  lead  to  a  large  sparse  system  of  nonlinear 
equations  to  be  solved.  The  theory  of  iterative  methods  to  solve 
these  equations  is  to  a  large  extent  independent  of  the  used 
discretisation  and  so  it  is  convenient  to  view  the  whole  problem 
as  solving  a  nonlinear  system  of  equations  iteratively  by  solving 
linear  systems.  The  existing  numerical  methods  can  essentially 
be  divided  into  two  classes:  The  first  approach,  a  block 
nonlinear  iteration  algorithm,  is  due  to  Gummel  /29/  and  uses  the 
fact  that  the  current  relations  are  linear  in  the  variables  u  and 
v  (as  defined  in  (2.11)).  In  these  variables  the  equations 
become  (again  we  use  the  scaling  of  /36/) : 
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(3.17) 


A2  *  d  i  v  grad  u  -  e  v  -  C 

a  a  m 

div  Jn  =  R,  -  eT*grad  u 

-k  -A  _U> 

div  J  =  -R,  J  *  -e  ^-grad  v 
P  P 


(3.18) 

(3.19) 


k  Jc4*l 

Gummels  approach  works  as  follows:  Given  (tpf u,v)  ^  is 
computed  by  solving: 

X2-div  grad  ^+1  =  e^+1-uk  -  e^+1  •vk  -  C  (3.20) 

k^l 

subject  to  the  appropriate  boundary  conditions.  Then  u 
k+1 

and  v  are  computed  from: 


div  J 


k+1  =  R(grad  q^+1,uk,vk) ,  Jk+1  =  ^grad  uk+1 


div  J^+1  -  -R(grad  uk , vk)  ,  J 

P  P 


k+1  ,  -q^+1. 


grad  v 


(3.21) 


(3.22) 
k+1 


together  with  the  boundary  conditions  for  u  and  v.  (3.21) 

k+1  k+1 

and  (3.22)  are  two  decoupled  linear  equations  for  u  and  v 
Poissons* s  equation  (3.20)  is  nonlinear  in  this  setting  and 

therefore  it  has  to  be  solved  iteratively  itself  in  each  step  by 

a  Newton  like  method.  Since  Newton's  method  is  an  inner 
iteration  within  the  overall  iteration  process  (3 . 20 ) - (3 . 22)  it 
may  not  be  necessary  to  let  this  inner  iteration  "fully  converge" 
/27/.  It  could  for  instance  be  considered  to  do  only  one  Newton 
step  for  each  iteration.  This  would  lead  to  the  linear  equation: 

A2 -div  grad  ^+1  =  (e^.u*  +  e-^-vk).(^+1-^)  + 

+  e^-uk  -  e"^-vk  -  C  (3.23) 

instead  of  (3.20).  The  advantage  of  Gummels's  method  is  obvious. 
(3.20 ) - (3.22)  can  be  solved  sequentially  which  decreases  the 
required  amount  of  storage  and  computing  time  drastically  for 
each  step.  However,  bad  convergence  properties  can  be  observed 
in  the  case  of  high  currents.  This  is  explained  by  viewing 
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(3*20)- (3.22)  as  iterating  the  map  M: (uk , vk) *(uk+^ , vk+1)  where 
the  evaluation  of  M  involves  the  solution  of  (3.20)*  Then  the 
norm  of  the  linearisation  of  M  (as  an  operator  acting  in  the 
appropriate  spaces)  at  the  fixpoint  M(u  ,v  )=(u  ,v  )  is 
proportional  to  the  current  densities  /42/. 

The  second  approach  to  the  solution  of  the  nonlinear 
equations  (2,1) -(2*5)  is  a  damped  modified  Newton  method.  To 
solve  the  general  equation  F(x)=0  one  computes  the  sequence  <xk> 
by: 

Mk*<fk  =  -F(xk),  xk+1  =  xk  +  tk-rfk  (3.24) 

For  the  usual  Newton  method  Mk  =  F ' (xk)  and  tk  =  1  holds. 

Bank  and  Rose  /4/  have  given  criteria  for  the  choice  of  the 

k 

damping  parameters  t  which  guarantee  global  convergence. 
Moreover  they  investigate  how  well  4  has  to  approximate  the 
classical  Newton  step  in  order  to  get  a  certain  rate  of 
convergence.  They  obtain  that  the  rate  of  convergence  is  p 
(l<p<2)  if: 

|Mk-<fk  +  F(xk)  |  =0(|F(xk)|p)  (3.25) 

holds  asymptotically  for  k  ♦  oo.  Alternatively  Bank  and  Rose  /3/ 
suggested  Mk  -  AkI  +  F* (xk)  where  Xk  is  proportional  to  |F(xk) |. 
Fran2  /24/  tested  this  method  with  good  success.  However,  he 

if 

additionally  chooses  damping  parameters  t  according  to  Deuflhara 
/15/,  /16/ . 

Since  this  approach  has  the  disadvantage  that  all  three 

equations  are  solved  simultaneously  -  and  therefore  the  storage 

requirements  are  fairly  large  -  we  suggest  a  Block-Newton-SOR 

T 

method  /24/.  Defining  F“(Fi,F2»F3)  Newton's  method  at  step  k 
is: 
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(3*26) 
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F3(^,nk,pk): 

Under  the  assumption  that  the  Jacobian  is  definite  one  can 
use  a  classical  block  iteration  scheme  (iteration  index  m)  for 
the  solution  of  the  k-th  Newton  step: 

(3.27) 
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Since  the  coefficient  matrix  of  (3,27)  is  block  lower 
triangular  one  can  decouple  the  elimination  process  into  three 
linear  systems  (3. 28) - (3. 30)  which  have  to  be  solved 
sequentially. 


(3.28) 


.  <fi$km+l  -  -Fi^/nkfpk)  -  ^nkm - 5J1* 


|£2k.  <fnkm+l  =  -F2(tJ*fnk,pk)  -  ^  1  _  X  rfpkm 


(3.29) 


^pkm+1  =  -F3(^,nk,pk)  -  <M®«- 


(3.30) 
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This  iteration  method  has  (like  Gummel's  method)  the 
advantage  that  the  equations  can  be  solved  sequentially.  To  end 
up  with  the  Block~Newton~SOR  method  one  has  to  resubstitute  the 
series  expansions  on  the  right  hand  side  of  (3, 28) - (3 , 30)  and  to 
introduce  a  relaxation  parameter  ufc 


\ 


?!ik 


•  ^atvHL 


•  Fi  f  n*^n^,pk+</pklT1) 


(3.31) 


_2  .  JntoH-1  _  ,  p2  ,  nk ,  p^p1™] 


(3.32) 


x— ^  .  J'pknH-l  -  (^W^i?^^/n^^n^IT^'^fp^) 

«P 


(3.33) 


This  method  converges  linearly  /48/.  However,  we  still  have 
to  perform” thorough  investigations  in  order  to  properly  judge  the 
convergence  properties. 


4.  Solution  of  Linear  Systems 

For  any  of  the  linearization  procedures  which  have  been 
outlined  in  the  last  chapter  a  large  sparse  linear  equation 
system  (4.1)  has  to  be  solved  repeatedly. 

A-x  =  b  (4.1) 

A  has  been  derived  by  linearizing  discretized  PDEs.  Hence  A 
has  only  five  to  nine  nonzero  entries  per  row  and  block  (the 
blocks  are  defined  in  (3.26));  A  is  very  sparse.  For  the 
solution  of  these  special  types  of  linear  systems  of  equations 
two  classes  of  methods,  can,  in  principle,  be  used:  direct 
methods  which  are  based  on  elimination  and  iterative  methods.  An 
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excellent  survey  on  that  subject  has  been  published  recently  by 
Duff  /18/.  Classical  Gaussian  elimination  is  not  feasible  for 
our  systems  of  equations  because  the  rank  of  A  in  (4.1)  is  very 
large  and  A  has  many  coefficients  which  are  zero.  Therefore, 
modifications  of  the  classical  Gaussian  elimination  algorithm 
have  to  be  introduced  to  account  for  the  zero  entries.  There 
exist  quite  a  few  activities  on  that  subject  (c.f.  /19/)  and 
powerful  algorithms  which  treat  the  nonzero  coefficients  only  are 
available  (so  called  sparse  matrix  codes) .  Another  serious 
drawback  of  direct  methods  lies  in  the  fact  that  the  upper 
triangular  matrix  which  is  created  by  the  elimination  process  has 
to  be  stored  for  back  substitution.  This  matrix  has  usually  more 
nonzero  entries  than  the  matrix  A.  Therefore,  memory  requirement 
of  direct  methods  is  substantial.  One  advantage  of  the  linear 
systems  obtained  from  the  discretised  semiconductor  equations  is 
that  no  pivoting  in  order  to  maintain  numerical  stability  is 
needed.  In  spite  of  all  drawbacks  of  direct  methods,  their  major 
advantage  is  high  accuracy  of  the  solution.  However,  we  feel 
that  for  the  semiconductor  problems  iterative  algorithms  are  to 
emphasize.  Nevertheless  we  and  many  others  have  observed 
difficulties  with  respect  to  the  convergence  speed  of  iterative 
methods,  so  that  the  direct  methods,  which  require  an  exactly 
predictable  amount  of  computer  resources,  will  always  stay  in 
consideration. 

The  fundamental  idea  of  relaxation  methods  (which  are  the 
best  established  iterative  methods)  is  the  splitting  of  the 
coefficient  matrix  A  (4.1)  into  three  matrices  D,  E,  F  (4.2). 

A  -  D  -  E  -  F  (4.2) 

D  denotes  the  diagonal  entries  of  A;  -E  denotes  a  lower 
triangular  matrix  which  consists  of  all  sub-diagonal  entries  of 
A;  and  -F  denotes  an  upper  triangular  matrix  which  consists  of 
all  super-diagonal  entries  of  A. 

With  an  arbitrary  non  singular  matrix  B  which  has  the  same 
rank  as  A  the  linear  system  (4.1)  can  be  rewritten  to  (4,3): 
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(4.3) 


B-x  +  (A-B) *  x  —  b 

One  obtains  an  iterative  scheme  by  setting: 
k+1  k 

B*x  -  b  -  (A-B) (4.4) 

k+1 

(4,4)  can  be  solved  for  x  : 

xk+1  =  (I-B_1.A)  "Xk  +  B^-h  (4.5) 

The  scheme  (4.5)  will  converge  if  condition  (4.6)  holds: 

*(I-B-1-A)  <1  (4.6) 

(4.6)  is  a  necessary  and  sufficient  condition  where  ? 

denotes  the  spectral  radius  /64/.  Any  relaxation  method  can  be 

derived  by  differently  choosing  the  matrix  B  from  the  splitting 
of  A  (4.2).  The  simplest  scheme,  the  point-Jacobi  method,  uses  D 
for  B.  Matrix  D  is  a  diagonal  matrix  and,  therefore,  is  easily 
invertible.  The  Gauss-Seidel  method  uses  D-E  for  B.  The  matrix 
D-E  is  a  lower  triangular  matrix.  Therefore  one  has  only  to 

perform  a  forward  substitution  process  for  its  inversion.  The 

successive  overrelaxation  method  (SOR)  uses  a  parameter  w  within 
the  range  ]0,2[.  The  iteration  matrix  B  is  defined: 

B  =  D/UI  -  E  •  (4.7) 

Since  B  is  again  a  lower  triangular  matrix,  its  inversion  is 
instantly  reduced  to  a  substitution. 

The  major  advantage  of  these  iterative  methods  lies  in  their 
simplicity.  They  are  very  easy  to  program  and  demand  only  low 
memory  requirement.  As  already  noted,  they  converge  if  condition 
(4.6)  holds.  However,  this  is  generally  difficult  to  prove.  A 
sufficient  condition  for  convergence  is  that  A  is  positive 
definite  (4.8)  which  is  the  normal  case  for  five-point-star 
discretized  PDEs. 

xT  •  A  •  x  >  0  foe  all  x/0  (4.8) 
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It  should  be  noted  again  here  that  the  current  relations  and 
continuity  equations  are  not  self  adjoint  if  (^n,p)  are  used  as 
variables  (see  (2.10),  (2.11)).  However,  the  transformation: 


(4.9) 


results  in  a  similarity  transformation  of  the  iteration  matrix  in 
(4.6).  Thus  the  spectral  radius  of  the  iteration  matrix  is  not 
influenced  and  the  same  convergence  properties  are  obtained  as  if 
the  system  had  been  discretized  in  its  self  adjoint  form  with 
CV,u,v)  as  variables. 

Some  point-iterative  schemes  can.  by  accelerated  quite 
remarkably  with  the  conjugate  gradient  method  or  the  Chebyshev 
method.  An  excellent  survey  on  these  topics  can  be  found  in 
/28/ . 

Various  activities  can  be  observed  for  the  development  of 
more  powerful  algorithms  with  the  advantages  of  iterative 
schemes.  One  of  the  best  known  algorithms  which  has  been 
established  in  semiconductor  device  analysis  is  Stone* s  strongly 
implicit  procedure  /58/.  Stone's  idea  was  to  modify  the  original 
coefficient  matrix  A  by  adding  a  matrix  N  (whose  norm  is  much 
smaller  than  the  norm  of  A)  so  that  a  factorization  of  (A+N) 
involves  less  computational  effort  than  the  standard 
decomposition  of  A,  Assuming  this  has  been  done,  the  development 
of  an  iterative  procedure  is  then  fairly  straightforward  because 
the  equation  can  be  written  as: 


(A+N) *x  =  (A+N) *x  +  (b-A*x) 


(4.10) 


which  suggests  the  iterative  procedure: 

(A+N) -xk+1  =  (A+N)*xk  +  (b-A-xk)  (4.11) 

When  the  right  hand  side  is  known  and  if  (A+N)  can  be 
factorized  easily,  (4.11)  gives  an  efficient  method  for  directly 
solving  for  x^+^.  Furthermore,  one  would  intuitively  expect  a 
rapid  rate  of  convergence  if  N  is  sufficiently  small  compared  to 
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A.  We  will  refrain  from  explaining  in  detail  Stone's  suggestion 
of  how  to  choose  the  perturbation  matrix  N  because  this  has  been 
done  thoroughly  in  many  publications  e.g.  /23/,  /55/,  /58/.  A 
major  disadvantage  of  Stone 's  method  is  that  it  is  only 
applicable  for  linear  systems  obtained  by  a  classical  Finite 
Difference  discretisation.  It  is  not  applicable  for  systems 
obtained  by  the  Finite  Boxes  approach  or  the  general  Finite 
Element  approach. 


There  exist  a  few  algorithms  which  are  similar  to  Stone's 
method  in  terms  of  underlying  ideas.  The  most  attractive  are  the 
method  of  Dupont  et  al.  /20/,  the  "alternating  direction 
implicit'  methods  e.g,  /6/,  /23/ ,  /66/  and  the  Fourier  methods 
/57/ #  /64/,  However,  most  of  these  sophisticated  algorithms  lack 
general  applicability. 


No  matter  which  iterative  method  is  used  one  has  to  deal 
w^th  the  question  of  an  appropriate  termination  (convergence) 
criterion.  Usually  (4.12)  is  applied  with  a  properly  chosen 
relative  accuracy  Zz 


|xk+1-xk|  <  €•  |xk+1 


(4.12) 


Since  increments  still  accumulate  when  (4.12)  is  already 
satisfied  we  suggest  to  use  (4.13)  instead  of  (4.12): 


lxk+1-xk 


<  €* |xk+1 | • (1-? (G) ) 


(4.13) 


?  (G)  can  be  estimated  as  |  xk+1-xk  | /|  xk-x 


k-1 


One  disadvantage  of  all  strongly  implicit  methods  and  also 
the  direct  methods  is  that  they  cannot  be  implemented  efficiently 
on  a  computer  with  a  pipe-line  architecture  (vector  processor) . 
Some  comments  on  that  subject  have  been  given  in  /18/. 
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5.  A  Glimpse  on  Results 


As  an  illustrative  example  a  relatively  'simple  structure,  a 

two  dimensional  diode,  is  chosen.  Fig.  4  shows  the  doping 

14  —  3 

profile  as  birds-eye-view  plot.  A  substrate  with  10  cm 
acceptor  concentration  and  an  exponentially  graded  n-region  with 
10  cm  maximum  doping  is  assumed.  The  initial  mesh  is 
automatically  generated  from  the  doping  profile  and  the  geometry 
definition.  The  simulation  domain  (device  geometry)  is  a  square 
of  100>f  times  100pm  size.  At  the  n-region  an  ohmic  contact  with 
length  20pm  is  assumed.  The  substrate  is  fully  contacted.  The 
initial  mesh  for  a  Finite  Boxes  program  is  shown  in  Fig,  5  and 
for  a  Finite  Element  program  in  Fig.  6,  The  point  allocation  is 
identical  for  both  representations.  The  grid  consists  of  121 
points  versus  173  when  all  gridlines  are  extended  throughout  the 
device.  This  clearly  demonstrates  the  advantage  of  the  Finite 
Boxes  approach.  In  Finite  Element  representation  one  has  to  deal 
with  80  rectangular  elements  and  17  transition  elements  which 
consist  of  51  triangles. 

Fig.  7  shows  the  final  grid  for  an  operating  condition  of 
0.7V  forward  bias  in  Finite  Boxes  representation.  This  mesh  is 
obtained  after  several  adaption  processes  using  the  criteria 
given  in  chapter  3.  It  consists  of  270  points  (versus  430  for 
the  classical  approach) *  In  Fig.  8  the  potential  distribution  is 
drawn.  From  this  plot  and  even  better  from  the  electron  density 
(Fig.  9)  one  nicely  can  deduce  the  effects  of  high  injection. 
E.g.  the  substrate  is  flooded  with  carriers.  Fig,  10  shows  the 
magnitude  of  the  electron  current  density.  The  peak  value  is 
about  180  A/cm^,  The  sharply  pronounced  peak  which  exists  at  the 
transition  of  the  Dirichlet  boundary  condition  to  the  Neumann 
boundary  condition  corresponds  to  a  singularity  of  the  carrier 
densities.  Physically  interpreted  this  effect  is  well  known  as 
contact-corner-current-crowding. 

Fig.  11  shows  the  final  grid  for  an  operating  condition  of 
-20V  (reverse)  bias  in  Finite  Element  representation.  This  mesh 
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consists  of  363  points  (625  for  classical  Finite  Differences) 
which  correspond  to  277  rectangular  elements  and  41  transition 
elements  (123  triangles) .  The  electron  density  for  this 
operating  point  is  given  in  Fig.  12,  One  nicely  observes  the 
depletion  region  and  the  typical  shape  of  the  drop  of  the 
electron  density  in  that  region  owing  to  thermal  generation.  In 
Fig.  13  the  magnitude  of  the  electron  current  density  is  drawn. 
The  singularity'  at  the  contact  corner  is,  although  it  still 
exists,  not  so  pronounced.  Note  that  there  are  about  seven 
orders  of  magnitude  difference  in  the  peak  value  compared  to  Fig. 
10. 

6.  Conclusion 

In  this  paper  we  have  presented  an  analysis  of  the  steady 
state  semiconductor  equations  and  the  impact  of  this  analysis  on 
the  design  of  device  simulation  programs.  3y  appropriate  scaling 
we  have  transformed  the  semiconductor  equations  into  a  singularly 
perturbed  elliptic  system  with  nonsmooth  data.  Information 
obtained  from  the  singular  perturbation  analysis  has  been  used  to 
investigate  stability  and  convergence  of  discretisation  schemes 
with  particular  emphasis  on  the  adaptive  construction  of 
efficient  grids.  We  have  reviewed  algorithms  for  the  solution  of 
nonlinear  and  linear  systems  of  the  discretized  semiconductor 
equations.  An  example  has  demonstrated  the  power  and  flexibility 
a  device  simulation  program  can  achieve  when  using  the 
information  we  have  presented  for  program  design. 
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Fig.  7  Final  Mesh  for  0.7V  Forward  Bias  (Finite  Boxes) 
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Fig.  Q  Potential  distribution  (0.7V)  [V]  ( 1 in ) 


391 


.S  >\^ 


Fig.  13  Electron  Current  Density  (-20V) 
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NON-LINEAR  MULTIDIMENSIONAL  MODELS,  I£AST  SQUARES 
AND  REDUCED  RESIDUALS 

Alvars  Celmi^s 

Ballistic  Research  Laboratory 
Aberdeen  Proving  Ground,- Maryland  21005 

ABSTRACT.  This  paper  addresses  the  treatment  of  such  model  fitting 
problems  that  include  an  analysis  of  the  residuals,  for  instance  to  determine 
heteroscedasticity*  In  cases  with  multidimensional  observations  the  residuals 
are  vectors  having  model  induced  correlations  between  components.  It  is  shown 
that  one  can  eliminate  such  correlations  and  also  reduce  the  number  of 
residual  components  by  introducing  a  concept  of  reduced  residuals.  The 
application  of  the  new  concept  is  illustrated  by  an  investigation  of 
heteroscedasticity  of  vapor  pressure  measurements. 

1.  INTRODUCTION.  The  residuals  of  model  fitting  problems  are  valuable 
sources  of  information  about  observational  errors,  model  adequateness  and  the 
relative  importance  of  observations.  The  information  is  typically  obtained  by 
analyzing  the  distribution  and  other  properties  of  the  residuals.  If  the 
model  equation  is  formulated  in  terms  of  one  scalar  dependent  variable,  and 
the  independent  variables  are  assumed  to  be  error  free  then  the  analysis 
involves  only  standard  procedures  for  the  distribution  of  a  scalar  quantity. 
If,  however,  the  adjustable  observations  are  not  scalars,  for  instance,  if  the 
independent  variables  are  subject  to  errors,  then  also  the  residuals  are 
vectors  and  the  usual  methods  of  residual  investigation  must  be  modified. 
This  paper  addresses  such  modifications  in  cases  of  general  non-linear  models, 
which  are  assumed  to  be  formulated  by  systems  of  implicit  equations,  and 
fitted  by  least  squares.  We  show  that  the  n-dimensional  residuals  of  the 
observations  are  not  necessarily  appropriate  for  an  analysis  of  distribution, 
heteroscedasticity  and  other  properties,  because  the  constraints  induce 
correlations  between  residual  components.  We,  therefore,  introduce  the 
concept  of  reduced  residuals.  These  residuals  have  a  dimension  that  is  less 
or  equal  to  that  of  the  observables,  and  their  components  are  dimensionless 
and  free  from  model  induced  correlations.  In  standard  least  squares  problems, 
where  the  model  equations  are  scalar,  also  the  reduced  residuals  are  scalars 
and  standard  techniques  can  be  used  for  the  investigation  of  their 
distribution.  One  also  can  show  that  no  information  is  lost  by  analysing  the 
reduced  residuals  instead  of  the  original  residuals,  that  is,  the  reduced 
residuals  contain  all  the  information  about  the  real  errors  that  can  be 
extracted  from  the  model  fitting.  An  example  of  the  use  of  reduced  residuals 
is  presented  by  discussing  the  investigation  of  heteroscedasticity  of  vapor 
pressure  measurements. 

The  main  definitions  and  principal  results  are  presented  in  Section  2, 
and  an  example  of  applications  is  given  In  Section  3.  The  background  of  the 
theory  is  outlined  in  the  Appendix,  where  we  present  detailed  derivations  of 
the  formulas  of  Section  2. 
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2.  REDUCED  RESIDUALS  IN  GENERAL  MODEL  FITTING  PROBLEMS.  We  give  in  this 
section  the  definition  of  reduced  residuals  and  present  some  of  their  more 
important  properties.  Derivations  of  the  formulas  and  detailed  discussions 
are  provided  in  the  Appendix. 

We  consider  least  squares  model  fitting  problems  that  can  be  formulated 
as  the  following  constrained  minimization  task: 

s  T  -1 

minimize  W  -  £  c,  R  c.,  (2.1a) 

i=l  1  1  1 

subject  to  +  ci>  t)  =  0>  i  =  .  (2.1b) 


The  are  s  observed  points  in  a  n-dimensional  space  of  observables,  that  is, 
each  observation  Xi  is  a  n-dimensional  vector;  the  are  the  corresponding 
residual  vectors;  t  is  a  p-dimensional  vector  of  model  parameters;  and  R^  are 
estimated  (n  x  n) -dimensional  variance-covariance  matrices  of  the  observations 
X^  Each  constraint  equation  (2.1b)  is  a  set  of  r  equations,  that  is,  the 
model  functions  Fi(C,t)  are  r-dimensional  vector  functions  of  n  +  p 
variables.  The  problem  (2.1)  includes  as  a  special  case  elementary  weighted 
least  squares  problems  for  which  r  -  1,  n  -  1  and  the  functions  F^(£,t)  are 
linear  with  respect  to  The  unknowns  of  the  problem  (2.1)  are  the  residual 

vectors  ^  and  the  model  parameter  vector  t.  The  given  input  consists  of  the 
observations  X^,  their  estimated  variances  and  covariances  R^,  and  postulated 
functional  relationships  F^(5,t)  =  0. 

The  least  squares  residuals  c^  are  n-dimensional  vectors  but  they  do  not 
necessarily  span  a  n-dimensional  space.  For  instance,  if  one  fits  a  straight 
line  to  observations  in  a  plane,  whereby  both  coordinates  are  adjusted  and  the 
are  all  equal,  then  all  residual  vectors  c^  are  parallel  to  each  other. 
That  is,  they  are  elements  of  a  one-dimensional  subspace  (line)  of  the  two- 
dimensional  space  of  observables.  In  this  example  all  information  about 
distribution  properties  of  the  residuals  is  contained  in  the  algebraic  lengths 
of  the  residuals,  and  the  analysis  of  the  residuals  can  be  reduced  to  the 
analysis  of  the  distribution  of  a  scalar. 

In  order  to  effectively  handle  the  described  situation  and  similar  more 
general  problems  we  define  reduced  residuals  by  the  equation 


(F  R  FT  )_i^2F  c 
^  xi  i  xi;  xi  i 


(2.2) 


where  F  .  is  the  Jacobian  matrix 


3F  (X  +  c  ,  t) 

F  =  - 1 - i - - - 

xi  3X. 

r 


(2.3) 
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The  reduced  residuals  a^  have  the  same  number  of  components  as  F^,  and  the 
components  are  dimensionless*  In  the  planar  curve  fitting  problem  described 
above,  the  reduced  residuals  a^  are  dimensionless  scalars  with  an  absolute 
value  equal  to  a  norm  of  c^,  and  positive  or  negative,  depending  on  which  side 
of  the  line  Fi  =  0  the  observation  is  located. 

Proper  norms  of  the  residuals  c^  are  in  least  squares  problems  the 
elliptic  norms 


->1/2 
Ri  v 


(2.4) 


because  the  objective  function  W  in  (2.1a)  then  is  the  sum  of  the  squares  of 
the  norms  ||c^||  .  For  later  reference  we  also  define  an  inner  product 
associated  with  each  observation  ^  by 

(ci*  bi^  =  ci  Ri^bi  *  (2,5) 

We  show  in  the  Appendix  that  the  Euclidean  norm  of  the  r-dimensional 
reduced  residual  a^  is  equal  to  the  elliptic  norm  (2*4)  of  the  n-dimensional 
residual  c^ : 


*;V1/2 


(2.6) 


In  a  standard  least  squares  problem  the  are  scalar  model  functions. 
In  that  case  the  definition  (2.2)  of  the  reduced  residuals  can  be  replaced  by 


ai  =  (c*  ^  sgn  (Fxi  ct).  (2.7) 

In  elementary  least  squares  problems  the  F^  are  assumed  to  be  linear  with 
respect  to  the  observable.  In  that  case  Eq.  (2.7)  futher  simplifies  to 


a.^  =  c^/  e^»  (2.8) 

where  e^  is  the  estimated  standard  error  of  the  scalar  observation  X^,  Hence 
in  elementary  least  squares  problems  the  reduced  residuals  are  identical  to 
the  familiar  weighted  residuals* 
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The  concept  of  reduced  residuals  can  be  geometrically  interpreted  as 
follows*  The  fitted  r-dimensional  model  equation  Fi(?,t)  =  0  defines  in  the 
n-dimensional  £ -space  of  observables  a  (n  -  r) -dimensional  hypersurface,  and 
the  corrected  observation  X^  +  c^  is  a  point  of  that  surface.  The  least 
squares  residual  c^  is  a  n-dimensional  vector  orthogonal  to  the  hyper surf ace 
=  0.  (Orthogonal  in  the  sense  of  the  inner  product  (2.5).)  That  is,  all 
possible  least  squares  residuals  corresponding  to  the  hypersurface  point  + 
c^  define  a  hyperplane  orthogonal  to  the  surface  F^  «  0.  The  hyperplane  is  a 
r-dimensional  linear  subspace  of  the  n-dimensional  £ -space,  and  the  reduced 
residuals  a^  are  elements  of  that  subspace. 

The  advantages  of  working  with  the  a^  instead  of  the  are  as  follows. 
First,  the  a^  have  in  general  less  components  than  the  c^.  Second,  the 
components  of  the  a^  are  dimensionless  whereas  the  components  of  the  have 
generally  different  physical  dimensions.  Third,  there  are  no  restrictions  on 
the  components  of  the  a^,  whereas  the  components  of  the  c^  are  restricted  by 
the  condition  of  orthogonality  to  -  0.  The  latter  condition  induces 
apparent  correlations  between  components  of  the  c^;  the  components  of  the 
are  free  from  such  correlations.  If  correlations  between  components  of  the  ai 
are  detected  then  they  generally  indicate  correlations  between  observations  as 
we  will  show  next. 

A  firs£  order  relation  between  the  reduced  residual  a^  and  the  unknown 
real  error  c^  of  the  observation  X^  is  (see  Appendix,  Eq.  (A. 23)) 


(Fxl  Rt  F.T, )  "1/2  F. 


XI 


Xici 


+  (F. 


xi  Ri  Fxi}  F 


ti 


(t  -  t),  (2.9) 


where  F^  is  the  Jacobian  matrix 


F 


ti 


3  yxi +  cr 


(2.10) 


and  t  is  the  unknown  true  value  of^  the  model  parameter.  The  relation  (2.9) 
neglects  terms  of  higher  order  in  ^  -  c^  and  t  -  t.  The  second  term  on  the 
right  hand  side  of  Eq.  (2.9)  is  a  function  that  in  general  slowly  varies  along 
the  hypersurf^ce  F^  =  0.  It  can  be  interpreted  as  a  distance  between  the  true 
surface  F^(^,t)  -  0  and  the  least  squares  fit  F^(£,t)  -  0.  The  first  tejm  on 
the  right  hand  side  is  the  projection  of  the  true  observational  error  ^  on 
the  subspace  orthogonal  to  F^  =  0.  Therefore,  any  local  scatter  of  the  true 
observational  errors  will  usually  manifest  itself  as  a  local  scatter  of  the 
reduced  residuals  a^.  In  particular,  if  the  c  are  componentwise  normally 
distributed  then  so  are  their  projections  and,  except  for  a  bias  caused  by  the 
slowly  varying  second  term  in  Eq.  (2.9),  the  reduced  residuals  a^. 
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Figure  1  presents  a  geometrical  interpretation  for  the  case  where  Fi  =  0 
defines  a  curve  in  the  plane  of  observations.  The  least  squares  residual  c^ 
is  orthogonal  in  the  sense  of  the  inner  product  (2.5)  to  the  fitted  curve,  and 
it  defines  the  subspace  (line)  for  a^  The  latter  is  a  measure  for  the 
algebraic  length  of  ^  along  the  normal.  According  to  Eq.  (2.9)  ai  is  the 
sum  of  the  projection  of  the  true  error  c.  on  the  normal  and  a  term 

representing  the  distance  between  the  true  anJ  the  fitted  curve,  measured 
along  the  same  normal. 

3.  APPLICATION  EXAMPLE.  An  example  for  the  use  of  reduced  residuals  is 
the  following  investigation  of  heteroscedasticity  of  vapor  pressure 
measurements.  The  measurements  consist  of  a  series  of  pressure-temperature 
correspondences  to  which  one  fits  the  so-called  Antoine  equation,  viz., 

F(p , T;  A, B,C)  *  lg(p/pR)  -  A  +  B/[(T  -  273.15)  +  C]  -  0  (3.1) 

where  p  (Pa)  is  pressure,  T  (K)  is  temperature,  pR  (Pa)  is  a  reference 

pressure  (usually  pR  =  1  torr  =  7.50064 -10-5  Pa),  and  A,  B  and  C  are  model 
parameters  [lj.  Because  the  model  function  (3.1)  is  scalar,  this  is  a 

standard  least  squares  problem  that  can  be  solved  numerically  with  available 
utility  routines  [2J  provided  that  estimates  are  available  of  the  accuracies 
of  the  observations  of  T  and  p.  More  typical  for  these  measurements  is  a 
situation  where  the  information  about  data  accuracies  is  incomplete:  one  can 
assume  that  the  temperature  measurements  are  all  made  with  the  same  standard 
error,  but  the  pressure  standard  errors  likely  do  depend  on  the  pressure. 
Then  the  problem  is  to  find  from  the  same  data  set  estimates  for  the  model 
parameters  A,  B  and  C  as  well  as  estimates  for  other  parameters  describing 
the  data  accuracy,  particularly  the  dependence  of  the  pressure  accuracy  on  the 
pressure.  We  assume  that  all  observations  are  independent  and  postulate  the 
following  models  for  the  standard  errors  of  pressure  and  temperature 

measurements : 

epi/pR  =  eo  (1  +  ei(Pi  +  cpi)/pR) 

(3.2) 

eTi  =  eoS2 

In  Eq.  (3.2)  eQ  is  the  standard  error  of  weight  one  which  is  computed  after 
the  adjustment  by 

eQ  =  [W/(s  -  3)]  1/'2,  (3.3) 
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Figure  1.  Reduced  Residual  in  Planar  Curve  Fitting. 

The  reduced  residual  a.  is  a  scalar  measuring  the 
algebraic  length  of  the  residual  c^. 
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and  cpi  is  the  least  squares  residual  of  the  observed  pressure  The  error 

model  (3.2)  contains  two  parameters,  0-  and  0^,  For  9^  it  represents  a 

constant  pressure  error  assumption,  and  for  large  0  it  represents  a  constant 
relative  pressure  error.  The  second  error  model  parameter  0^  has  the 
dimension  of  temperature  and  it  permits  one  to  model  the  relative  significance 
of  pressure  and  temperature  observations.  If  -  0  then  the  temperature 
observations  are  assumed  to  be  error  free. 

We  determine  the  error  model  parameters  by  minimizing  an  objective 
function  S  which  we  define  in  terms  of  the  reduced  residuals  as  follows 

s  ■ 1  +iK  qi ln  v  <3-4a) 


where 


(3.4b) 


The  use  of  the  negative  entropy  function  S  as  an  objective  function  was 
suggested  by  Nielsen  [3J.  His  definition  of  the  q.^  in  Eq.  (3.4b)  was, 
however,  in  terms  of  weighted  residuals  instead  of  reduced  residuals,  because 
he  was  only  treating  elementary  weighted  least  squares  problems,  Nielsen 
chose  S  as  a  measure  for  the  optimality  of  the  error  model  parameters  because 
of  the  following  properties  of  the  function: 


(a)  S  is  maximum  and  equals  one  if  all  but  one  of  the  qi  are  zero. 

(b)  S  is  minimum  and  equals  zero  of  all  q^  =  1/s. 

(c)  Any  averaging  of  the  q.^  reduces  S,  that  is,  if 


0  <  a^_.  <1,  ^  ^  ~  aij  i  -  1,2,...  ,s 


and 


’i  aia  V 


i-1 


then 


S  (q)  <  S  (q)  . 
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Hence  a  smaller  S  generally  means  smaller  differences  between  the  q^.  Because 
of  the  definition  (3.4b)  and  the  relation  (2*6)  this  in  turn  means  that  a 
minimization  of  S  tends  to  equalize  the  | |c^| |  •  For  given  values  of  0^  and 

0 2  one  computes  S  by  first  carrying  out  a  least  squares  adjustment  with  the 
model  function  (3.1)  and  the  error  estimates  (3.2).  The  adjustment  produces 
least  squares  values  of  model  parameters  A,  B  and  C  and  a  set  of  residual 
vectors  c^.  These  results  provide  via  Eq.  (2.7)  the  reduced  residuals  from 
which  the  value  of  S  can  be  computed  by  Eq.  (3.4). 

We  notice  in  passing  that  the  choice  of  S  as  an  objective  function  is 
arbitrary  and  one  may  instead  use  other  functions  with  similar  properties 
[4J.  In  limited  numerical  experiments  we  indeed  found  little  difference 
between  corresponding  results  with  different  objective  functions.  The  results 
quoted  in  this  paper  are  for  the  objective  function  S  as  defined  by  Eq.  (3.4). 


The  optimality  of  the  error  model  parameters  can  also  be  defined  and 
tested  by  other  means  than  an  objective  function.  For  instance,  one  can  in 
the  present  example  investigate  the  distribution  of  the  reduced  residuals  and 
compare  it  to  a  normal  distribution  if  the  real  errors  are  known  to  be 

normally  distributed  with  a  heteroscedasticity  parameter  0^  in  the  form  of  Eq. 

(3.2).  Then  the  optimal  values  of  0  ^  and  0^  would  be  those  for  which  the 

reduced  residual  distribution  is  closest  to  normal  according  to  some 

appropriate  criterium.  As  we  shall  see  in  our  examples,  such  an  approach  is 
practically  equivalent  to  the  minimization  of  S. 

Explicit  formulas  for  the  reduced  residuals  and  other  quantities  in  the 
case  of  vapor  pressure  measurements  are  as  follows.  The  observation  vectors 

xi are 


xi 


(3.5) 


the  corresponding  residuals  are 


the  estimated  variance-covariance  matrices  are 


[i  +  +  c„,  )/pj2 


R.  = 
1 


V'- 


(3.6) 


0 


(3.7) 


and  the  reduced  residuals  are 


h  =  rc— 


Cj>i_/pR 


Tiv  2  t 1/2 


6l(pi  +  Cpi)/pR 


Y  +  (irV  •) 


s®n  (FpiCpi  +  FTiCTl^  (3-8) 


where  Fp^  and  FTi  are  the  partial  derivatives  of  the  model  function  (3.1)  with 
respect  to  p  and  T,  evaluated  at  the  adjusted  observations.  Hie  constraint 
equations  are  obtained  from  Eq,  (3*1)  by  setting 


F(pt  +  cpi,  Ti  +  cT±;  A,  B,  C)  =  0  ,  i  -  l,...,s. 


(3.9) 


Next  we  present  a  numerical  example  with  simulated  data  and  known  error 
distribution.  The  data  were  obtained  by  choking  a  set  A,  B,  C  of  Antoine 
parameters,  calculating  for  s  =  40  equidistant  values  the  corresponding 
from  Eq.  (3.1),  and  subtracting  from  the  p^  and  random  errors  c  i  and 
c  with  known  normal  distributions.  The  simulated  observations  thus  have  the 
values 


(3.10) 


Figure  2  shows  a  typical  s^  gf  simulated  data.  The  Antoine  parameters  that 
were  used  to  ^ calculate  p.(T.)  and  the  error  model  parameters  for  the 
calculation  of  c  .  and  c  are  listed  in  Table  1.  The  table  also  contains  the 
parameters  of  tJe1  fitted"  curve  shown  in  Figure  2,  and  of  the  error  ellipses. 
The  curve  and  the  error  ellipses  correspond  to  an  optimal  value  of  9^  that 
minimizes  the  objective  function  S  for  this  data  set.  The  second  error  model 
parameter  9^  was  chosen  such  that  e^  -  0^2  -  0.1  K,  that  is,  the  temperature 
standard  error  e^  was  preset  to  the  exact  value.  The  least  squares  fitting 
was  done  using  the  utility  program  COLSAC  [2].  The  confidence  limits  for  the 
fitting  curve  were  computed  by  solving  the  Antoine  Eq.  (3.1)  for  p  and 
applying  the  linearized  law  of  variance  propagation  to  the  function 
p(T;A,B,C),  that  is,  by 


e 

P 


[  3p(T;A,B,C) 

3  (A,B,C)  VABC 


3p(T;A,B,C)x  T^/2 
3(A,B,C)  ■*  1 


(3.11) 
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TEMPERATURE 


Simulated  Data  with  Fitted  Curve. 

Error  assumptions  for  the  fitting  are 
ePi/pR  =  i-19  +  °-°55  (Pi  +  cpi)/pR. 
e11  "  °‘1  K 

The  confidence  limits  and  error  ellipses  correspond 
to  3.4  standard  errors.  Pressure  is  shown  in  torr 
and  temperature  is  shown  °C. 


where  3p/3(A,B,C)  is  the  Jacobian  matrix  of  the  function  p(T;A,B,C) 

(the  gradient  of  p  in  the  A»B,Ospace) ,  and  VABC  is  the  estimated  variance- 
covariance  matrix  of  the  parameters  A,  B  and  C.  The  matrix  VABC  is  defined  in 
terms  of  the  standard  errors  of  A,  B  and  C,  and  of  the  corresponding 
correlation  matrix  CAfiC  (all  given  in  Table  1)  by 

VABC  =  DABC  CABC  DABC  »  (3.12) 

where  DABC  is  a  diagonal  matrix  with  the  standard  errors  in  the  diagonal. 
About  the  computation  of  VAB£  (and  C^g^)  from  the  least  squares  adjustment  see 
reference  [5].  The  matrices  are  part  of  the  output  of  the  utility  program 
COLSAC. 

The  dependence  of  the  objective  function  S  on  the  pressure  error 

parameter  0^  is  illustrated  in  Figure  3  for  a  fixed  temperature  standard  error 

eT  =  0.1  K.  The  shape  of  the  curve  is  typical  for  sample  problems  with  a 

sufficiently  large  number  of  data  (s  >  40)  and  a  reasonable  <  1.0  K.  Also 

the  numerical  results  are  only  little  influenced  by  the  preset  value  of  e^,  as 

shown  in  Table  1.  We  conclude  from  these  experiments  that  the  pressure  error 

heteroscedasticity  (the  parameters  e  and  e  0,)  can  be  reasonably  well 

O  O  X 

retrieved  if  s  )  40,  but  that  the  temperature  standard  error  eT  practically 
cannot  be  retrieved.  If  the  number  of  data  points  is  too  small  (s  <  10)  then 
one  observes  large  variations  of  the  optimal  0 ^values  between  different  sets 
of  random  input  errors.  If  the  temperature  standard  error  is  preset  to  a  high 
value  (eT  >  1.0  K)  then  the  least  squares  adjustment  produces  a  solution  were 
practically  only  the  temperature  observations  are  adjusted  and,  therefore,  the 
pressure  error  parameters  cannot  be  retrieved.  The  difference  between  input 
pressure  error  parameters  and  retrieved  parameters  shown  in  Table  1  is  typical 
for  the  given  size  of  the  problem.  The  differences  and  case  to  case 
variations  decrease  if  the  number  of  observations  Increases. 

The  distribution  of  the  reduced  residuals  is  illustrated  by  Figures  4 
through  6.  The  figures  show  the  normal  distribution  compared  to  the 
cumulative  residual  distribution  in  a  case  described  in  Table  1  and  for 
different  values  of  0^.  It  is  obvious  that  smaller  values  of  the  objective 
function  S  correspond  to  reduced  residual  distributions  that  are  closer  to 
normal.  For  comparison  we  show  in  Figure  7  the  cumulative  distribution  of  the 
reduced  true  input  errors  for  the  same  example.  One  observes  that  the  optimal 
distribution  of  the  reduced  least  squares  residuals  (Figure  5)  is  closer  to 
normal  than  the  distribution  of  the  corresponding  reduced  true  input  errors, 
drawn  from  a  random  number  generator. 

The  next  example  is  a  case  with  real  data  taken  from  reference  [1],  The 
data  are  shown  in  Figure  8  and  numerical  results  are  given  in  Table  2.  The 
dependence  of  the  objective  function  S  on  the  pressure  error  parameter  9^  Is 
illustrated  by  Figures  9  and  10.  In  this  case  S(9)  has  no  minimum  for  a 
finite  9  .  Therefore,  the  optimal  choice  is  a  constant  relative  pressure 
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OBJECTIVE  FUNCTION 


PRESSURE  ERROR  PARAMETER  THETA-1 


Figure  3.  Objective  Function  S  for  Simulated  Data. 

The  temperature  standard  error  e  =  e  1  is 
.set  to  0.1  K.  ° 
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tribution  Cumulative  Distribution 


-4*S  ~2*S  0*S  2*S  4*S 

Standard  Deviation 

Figure  4.  Comulative  Distribution  of  Reduced  Residuals  for  Simulated  Data 
and  Small  9^-10 

The  temperature  standard  error  is  set  to  eT  =  0.1  K. 


-4*S  -2«S  0*S  2«S  4 • S 

Standard  Deviation 

Figure  5.  Cumulative  Distribution  of  Reduced  Residuals  for  Simulated 
Data  and  Optimal  =  0.049. 

The  temperature  standard  error  is  set  to  =  0.1  K. 
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Standard  Deviation 


Figure  6.  Cumulative  Distribution  of  Reduced  Residuals  for  Simulated  Data 

4 

and  Large  6^  ■  10  . 

The  temperature  standard  error  is  set  to  =  0.1  K. 


Figure  7.  Cumulative  Distribution  of  Reduced  True  Simulated  Errors. 
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Data  and  Fitted  Curve  for  1-Tetradecanol 
Error  assumptions  Curve  for  fitting  are 

epi/PR  '  °-0294  (Pi  +  Cpi)/pR 


29.  tO 


The  confidence  limits  and  error  ellipses  correspond 
to  11.9  standard  errors.  Pressure  is  shown  in  torr 
and  temperature  is  shown  in  °C. 


PRESSURE  ERROR  PARAMETER  THETA-1 


Figure  9.  Objective  Function  S  for  1-Tetradecanol  Assuming  Zero 
Temperature  Error. 


PRESSURE  ERROR  PARAMETER  THETA-1 


Figure  10.  Objective  Function  S  for  1-Tetradecanol  Assuming 
Temperature  Error  em  =  0.1  K. 


standard  error  (corresponding  to  an  infinite  0.),  as  indicated  in  Table  2.  A 
S(0)  without  a  minimum  seems  to  be  typical  if  the  number  s  of  data  points  is 
small. 


The  difference  between  the  present  results  and  those  by  Kemme  and  Kreps 
is  between  one  and  two  standard  errors  of  the  Antoine  parameters.  It  is  not 
clear  from  reference  [1]  how  the  Antoine  parameters  were  calculated,  but  the 
reported  values  are  between  those  for  constant  ep  and  for  ep  proportional  to 
p.  We  obtained  almost  the  same  Antoine  constants  as  Kemme  and  Kreps  by 
assuming  eT  =  0  and 

epi/pR  -  0.139  +  0.139  (p±  +  V'PR- 

The  corresponding  S  was  0.4663  indicating  that  this  is  not  an  optimal  choice 
for  the  error  models. 


Table  1.  Result  from  a  Simulated  Experiment 


Input 

Retrieved 

eo  92 

0.1  K 

(0.0  K) 

(0.1  K) 

(1.0  K) 

eo 

1.0 

1.20 

1.20 

1.11 

■H 

CD 

O 

(U 

0.05 

0.056 

0.055 

0.051 

S 

— 

0.232  944 

0.232  945 

0.233  230 

A 

7.0 

7.567  +  0,629 

7.567  +  0.629 

7.570  +  0.631 

B 

1900  K 

2406  +  551 

2406  +  551 

2408  +  552 

C 

130  K 

181.39  +  49.88 

181.40  +  49.88 

181.51  +  49.88 

CAB 

— 

0.998  909  68 

0.998  909  86 

0.998  927  17 

cAC 

— 

0.995  506  28 

0.995  506  93 

0.995  571  48 

CBC 

— 

0.998  824  72 

0.998  824  86 

0.998  839  41 

®  °AC  an^  cBC  are  correlation  coefficients  of  the  Antoine 

parameters  A,  B  and  (7.  The  temperature  error  estimate  e ^  was  preset  as 
indicated . 
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Table  2.  Adjustment  Results  for  1-Tetradecanol 


Present 

Analysis 

Kemme 

and 

Kreps [ 1 ] 

eT  =  0,1  K 

eT  =  0 

eT  0 

Optimal 

ypR 

0.0294  p/pR 

0.0294  p/pR 

— 

s 

0.2974 

0.2973 

— 

A 

6.2284  +  0.1846 

6.2251  +  0.1851 

6.4840 

B  (K) 

1250.2  +  106.3 

1248  +  106.7 

1412.907 

C  (K) 

76.23  +  12.02 

76.01  +  12.06 

95.368 

CAB 

0.997  527  0 

0.997  525  0 

— 

cAC 

0.991  019  0 

0.991  025  3 

— 

cBC 

0.997  877  0 

0.997  882  1 

— 

The  aABj  oaq,  and  are  correlation  coefficients  between  the  Antoine 
parameters  A ,  5,  and  C .  The  temperature  range  of  observations  is  between 

425,15  and  569,15  K  (152  and  296°C)  . 

4.  Summary  and  Conclusion.  Multidimensional  residuals  arise  in  model 
fitting  problems  when  more  than  one  component  of  the  observables  is  subject  to 
adjustment*  Because  the  corrected  observations  must  satisfy  constraints 
representing  the  model  and  because  such  constraints  effectively  reduce  the 
degrees  of  freedom  for  the  corrections,  the  components  of  multidimenional 
residual  vectors  typically  are  strongly  correlated.  These  correlations  must 
be  eliminated  or  otherwise  taken  into  account  when  the  residuals  are  analyzed, 
so  that  interpretation  errors  can  be  avoided*  We  suggest  in  this  paper  to 
eliminate  the  model  induced  correlations  by  using  reduced  residuals.  The 
latter  generally  have  less  components  than  the  original  residuals,  and  the 
components  are  dimensionless.  The  reduction  of  the  number  of  components 
matches  the  loss  of  degrees  of  freedom,  so  that  there  are  no  correlations 
between  components  of  reduced  residuals,  except  when  such  correlations  are 
present  in  the  observations.  In  standard  least  squares  problems  (curve 
fitting  in  a  plane,  surface  fitting  in  three  dimensions,  etc.)  the  reduced 
residuals  are  scalars  with  an  absolute  value  equal  to  a  norm  of  the  residual 
and  a  positive  or  negative  sign,  depending  on  the  location  of  the  observation 
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with  respect  to  the  fitted  structure.  Hence,  by  introducing  reduced  residuals 
one  can  use  in  standard  least  squares  problems  well  known  techniques  for 
statistical  investigation  of  scalar  quantities.  As  an  example  for  such 
application  we  presented  the  investigation  of  heteroscedasticity  of  vapor 
pressure  measurements.  In  elementary  least  squares  problems  the  reduced 
residuals  are  identical  to  the  usual  weighted  residuals. 

In  conclusion,  the  introduction  of  reduced  residuals  greatly  simplifies 
the  investigation  of  residuals  arising  in  general  least  squares  problems.  The 
concept  has  a  simple  geometric  interpretation  and  the  routine  calculation  of 
reduced  residuals  easily  can  be  included  in  least  squares  utility  routines. 
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appendix 

PROPERTIES  OF  REDUCED  RESIDUALS 


We  provide  in  this  appendix  proofs  and  derivations  of  the  formulas  quoted 
in  Section  2* 


In  the  model  equations 

FiU,  t)  =  0,  i  -  1,...,  s  (A.  1) 


of  the  minimization  problem  (2*1)  the  r-dimensional  vector  functions  F^S.t) 
are  assumed  to  be  componentwise  twice  differentiable  with  respect  to  all  its  n 
+  p  arguments*  We  also  assume  that 

rank  3F/3£  =  r  (A* 2) 

in  a  neighborhood  of  the  least  squares  solution.  The  condition  (A. 2)  insures 
that  the  r  equations  in  each  set  in  Eq.  (A.l)  are  independent.  Furthermore, 
we  Insure  sufficient  degrees  of  freedom  for  the  optimization  problem  (2.1)  by 
assuming  that  the  following  inequalities  are  satisfied  by  the  dimensions  of 
the  problem: 

0  <  p  <  r*s  <  n*s  .  (A. 3) 

*■  ♦ 

The  normal  equations  for  the  optimization  problem  can  be  derived  by  Lagrange fs 
multiplier  technique.  To  that  end  we  define  a  modified  objective  function 
W  by 


W 


2  1  T 
£  "2ci 

i=r 


T 

\  ci 


-  I  kJ  F,(X  +  c  ,  t) 
i=l 


(A.4) 


where  the  k^  are  r-dimensional  correlate  vectors  (Lagrange  multipliers).  Then 
the  normal  equations  are  obtained  by  setting  equal  to  zero  the  partial 


derivatives  of  W  with  respect  to  the  unknown  cit  t  and  The  result  is 

16,  7,  8,  9] 


c, 

i 


-  Ri 


T 

F  .k* 
xi  l 


0 


(A. 3a) 


I  Ft±  =0  ,  (A. 5b) 

i*l 


Ft 


(A. 5c) 
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Because  of  the  rank  condition  (A. 2)  and  because  R^  is  a  positive  definite 
matrix,  the  product  is  also  positive  definite  and  one  can  obtain  from 
Eq.  (A. 5a) 


ki  =  <Fxi  Vxi)~1  Fxi  ci  • 


Substituting  this  expression  back  into  Eq.  (A. 5a)  one  obtains  [6] 


ci  =  RiFxi(FxiRi  FxirlFxici  • 


(A.  6) 


We  shall  use  this  important  relation  later. 

In  the  n-dimensional  space  of  observables  we  define  the  inner  products 


(ci>  b^)  =  ciT  R^  b^ 


(A.7) 


and  the  elliptic  norms 


v1/2 


(A. 8) 


These  definitions  are  possible  because  the  R^  are  positive  definite.  We 
notice  that  the  norm  (A. 8)  is  dimensionless. 

The  model  equation  F^($,t)  -  0  defines  a  (n  -  r) -dimensional  hypersurface 
in  the  n-dimensional  space  of  observables.  Let  be  a  point  of  that 
surface.  The  hyperplane  orthogonal  to  the  surface  Fi  *  0  at  that  point  is  r- 
dimensional  and  is  spanned  by  the  rows  of  the  matrix 


=  Fxi  Ri  • 


(A.9) 


that  is,  for  an  arbitrary  r-vector  a^  the  corresponding  n-vector 


"i  -  “Fi  \  =  “l  Fxi*i 


(A. 10) 
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is  orthogonal  to  =  0,  whereby  the  orthogonality  is  defined  in  terms  of 
the  inner  product  (A.7).  To  show  this,  we  compute  the  inner  product  of  n^ 
with  a  vector  tangential  to  the  surface.  The  latter  vector  satisfies  the 
equation 


Fxi  bi  -  0  • 


(A. 11 ) 


The  inner  product  is 


(ni»  V  =  (Mn  V  bt  ) 


~T  „T  _-l  . 
ai  MFiRi  bi 


■  ?  Fxi  bi  =  0 


for  arbitrary 

Next,  we  normalize  the  matrix  by  defining 


(A. 12) 


(Fxi  %  Z)  ~U1  MFi 


<Fxi  Ri  Fxi  )_1/2  Fxi  R1 


(A. 13) 


The  r  rows  of  the  matrix  Nfi  again  span  the  orthogonal  hyperplane,  but  they 
are  unit  vectors  orthogonal  to  each  other.  We  show  this  by  computing  the 
inner  product  of  with  itself : 


NFi  %i 


-  (Fxi  Ri  4>‘1/2  Fxi  Ri  Fxi  (Fxi  RiFxirl/2 


(A. 14) 


That  Npi  spans  the  same  subspace  as  follows  from  the  formula 


-  T  T  -1/2  ^ 

N~  b.  =  R.F  .(F  ,R.F  ,)  b,  =  M_.  a^ 

FT  l  i  xi  xi  i  xi  i  Fi  i 


(A, 15) 


for  arbitrary  and  a  corresponding  proper  a^, 
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Next  we  show  that  the  least  squares  residual  is  an  element  of  the 
orthogonal  space.  According  to  Eq.  (A. 6)  the  residual  satisfies  the 
relation 


(A. 16) 


%  F*i  (Fxi  Ri  Fxi)_1/2  (Fxi  Ri  Fxi)_1/2  Fxi  ci 


NFi  NFiT  Ri  1  ci  ~  NFi 


with  a  proper  b^  and,  therefore,  is  an  element  of  the  subspace,  as  claimed. 
The  reduced  residual  a^  is  defined  by  Eq,  (2.2),  or 


ai  -  <F*i  Ri  4)'1/2  Fxi  ci  -  RI*  ci  • 


(A. 17) 


It  is  an  r-vector  with  dimensionless  components,  and  it  represents  the 
residual  c^  in  the  orthogonal  hyperplane.  According  to  Eq.  (A. 16)  the 
relation  (A. 17)  also  can  be  solved  for  the  ci: 


ci  ’  NFi  ai 


(A. 18) 


Hence,  either  of  the  two  vectors  a.^  and  c^  are  uniquely  determined  by  the 
other. 


The  Euclidean  norm  of  the  reduced  residual  is,  because  of  Eqs.  (A. 17) 
and  (A. 13), 


||  =  (aT  a . ) ly^2  =  {cT  [FT  (F  .  r!  F  ,)”lF  .  c.]} 
i1  1  i  i  1  i  1  xi  xi  i  xi  xi  iJJ 


T  „  v-1. 


(A. 19) 


/  T  -1  v  1/2  . ,  ,, 

=  ( c .  R.  c . )  =  c , 

iii  i 

Eq-  (A.  19)  shows  that  the  Euclidean  norm  of  the  r-vector  a.^  is  equal  to  the 
elliptic  norm  (A. 8)  of  the  n-vector  ci# 

A  relation  similar  to  (A. 17)  also  exists  between  the  reduced  residual 
a^  and  the  unknown  true  observational  error  c  of  the  observation  X^-  Let 
t  be  the  true  value  of  the  model  parameter.  Then  by  definition 


Fi(Xi  +  *±*  *>  =  0 


(A, 20) 
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Expanding  the  model  function  F^  at  the  least  squares  position  one  obtains 


F±(X 


* 

i  +  ci,  t) 


Fi(X.  +  Cl, 


t)  +  Fxi(*i  ~  ci)  +  Fti(*  _  t>+—  (A.21) 


The  left  hand  side  and  the  first  term  of^  the  right  £and  side  are  zero.  If 
one  neglects  terms  of  higher  order  in  c^  c  and  t  -  t,  then  Eq.  (A*  21) 
yields  the  following  relation  between  ci  and  c^ 


Fxi  ci 


"xi 


1  +  Fti 


(t  -  t) 


(A. 22) 


Eq*  (A. 22)  is  a  relation  between  projections  of  c^»  and  t  -  t  on  the 
orthogonal  subspace.  Because  c^  is  an  element  of  that  subspace  one  can  use 
Eq.  (A. 17)  and  calculate  the  contributions  of  the  other  two  projections  to 
the  reduced  residual  a^: 


ai  “  <Fxi  Ri  Fxi)_1'2  Fxi  *±  +  <Fxi  Ri  FxlT)-l/2  Fti(*  -  •  <A'23> 


Figure  1  illustrates  the  relation  (A. 23)  in  a  planar  curve  fitting  case. 

The  relation  (A. 17)  has  the  inversion  (A. 18)  that  allows  one  to 
calculate  c^  if  a^  is  given.  A  similar  ^inversion  of  Eq.  (A. 23)  would  permit 
one  to  compute  the  unknown  true  errors  c.  in  terms  of  the  a^  that  are  known 
from  the  least  squares  fitting.  However,  such  an  inversion  of  Eq.  (A. 23)  is 
generally  not  possible.  For  Eq.  (A.  17)  it  was  derived  by  using  Eq.  (A. 6) 
which  ig  satisfied  by  the  least  squares  residuals  c^,  but  not  by  the  true 
errors  c^.  This  result  is  geometrically  obvious  because  only  a  projection 
of  c_£  enters  Eq.  (A, 23)  and  one  cannot  reconstruct  a  vector  from  its 
projection.  An  exception  is  the  special  case  r  =  n,  that  is,  the  case  where 
the  constraint  equation  =  0  has  as  many  components  as  the  space  of 
observables.  In  that  case  the  Jacobian  matrix  Fxi  can  be  inverted  and  one 
obtains  from  Eq.  (A. 22),  for  instance,  the  expression 


* 

c. 

l 


=  c. 


F  1  F 
xi  ti 


(A.24) 
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as  an  estimate  of  the  true  errors.  A  corresponding  relation  between  c^  and 
a^,  which  in  this  case  also  has  n  components,  can  be  derived  by  substituting 
Eq.  (A.  18)  into  Eq.  (A.  24).  However,  in  this  special  case  the  reduced 

residuals  offer  the  only  advantage  that  they  have  dimensionless 
components.  Therefore,  an  investigation  of  heteroscedasticity  can  be  more 

effectively  done  in  this  case  in  terms  of  the  c^  instead  of  the  reduced 

residuals  a^,  because  of  the  simplicity  of  the  relation  (A. 24)  between  ^ 
and  c.^ » 

A  least  squares  model  fitting  can  determine  only  the  projection  of  the 
true  observational  errors  on  the  space  orthogonal  to  the  surface  F  =  0. 

Because  other  components  of  the  true  errors  are  lost  it  is  reasonable  to 
carry  out  any  investigations  of  the  residuals  only  in  the  orthogonal 
space.  This  is  done  by  using  the  reduced  residuals,  which  contain  all  the 
information  about  the  true  errors  that  is  present  in  the  least  squares 
results. 


APPLICATION  OF  THE  PRINCIPAL  COMPONENT  METHOD 
TO  TRAJECTORY  ESTIMATION 

William  S.  Agee  and  Robert  H.  Turner 
Mathematical  Services  Branch 
Data  Sciences  Division 
National  Range  Operations  Directorate 
US  Army  White  Sands  Missile  Range 
White  Sands  Missile  Range,  New  Mexico  88002 


1.  TRAJECTORY  ESTIMATION.  Measurements  of  range,  azimuth,  and  eleva¬ 
tion  from  several  different  radars  are  used  to  estimate  the  cartesian  posi¬ 
tion  coordinates  of  a  vehicle  trajectory  at  a  sequence  of  times,  t, ,  i=l,N 
.  which  cover  the  entire  trajectory.  Since  the  measurements  are  1 
subject  to  systematic  errors  as  well  as  random  measurement  errors,  we  also 
want  to  estimate  the  systematic  error  parameters  or  biases  in  addition 
to  the  trajectory  coordinates.  The  resulting  estimation  problem  is  a 
combined  linear  and  nonlinear  estimation  problem  in  which  the  trajectory 
coordinates  appear  as  nonlinear  parameters  in  the  measurements  and  the 
biases  appear  as  linear  parameters  in  the  measurements. 


Let  ha(x.)  be  a  measurement  function  where  is  the  cartesian  position 
vector  to  the  trajectory  at  time  t,. .  If  we  have  M  different  radars  observing 
the  trajectory,  then  a  -  1,  3M,  For  a  range  measurement  from  the  p^-  radar 

h«(*i) =  ^xrV2  +  (yrV2  +  (zrzp)2]  l'z  (1) 

where  (xp,  y  ,  zp)  are  the  cartesian  coordinates  of  the  origin  of  the  local 
cartesian  coordinate  system  at  the  p^-  radar.  For  an  azimuth  measurement 
from  the  p^  radar  the  measurement  function  is, 


h  (x.)  *  tan~*  — £ 

«  1  y^  -  y„ 


-L  L 

For  an  elevation  measurement  from  the  p^-  radar 

-1 


(2) 


ha(5f)  =  tan' 


Zi  ~2P 


[(x.-x  )2  +  (y.-y  )2]  l/Z 


(3) 


Let  ( t ^ )  denote  the  observed  value  of  the  o^-  measurement. 


The 


observations  are  modeled  as, 

W  "  W  +ba+ea(1) 


(4) 
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where  b  is  a  constant  measurement  bias  and  e  (i)  is  a  zero  mean,  random 

01  Ct  -r 

measurement  error.  Let  b  be  a  3M-dimensional  bias  vector  b  =  [bi  b2— b'3^]. 
Then  the  measurement  model  can  be  represented  as 

=  +  Sab  +  *o0)  (5) 

+■  h 

where  s  is  a  row  vector  with  a  one  in  the  a—  entry  and  zeros  in  all  other 

06 

entries. 


s 

a 


=  [0  0  — 0  1  0  — 0] 


position 


(6) 


Let  ]  be  known  variances  of  the  random  measurement  errors,  eo(  1  ). 
The  estimation  problem  to  be  considered  is  to  minimize, 


N  3M  __  2  i 

*  VZaCtiKC^Kb)  RaCti}  C7) 

1=1  a=l 

with  respect  to  x.,  i  s  1,  N  and  b.  Differentiating  (7)  with,  respect  to  x  = 
and  b  results  in  the  nonlinear  normal  equations 


=  H^(ii)R-a1cti)(za(t1.)-ha(;t)-sab)  =  0  1  -  l.N  Cs: 

a=  I 

5  ™  sIR>t^V-hAKb>  =  o  o: 

1-  l  a- I 

where  is  the  estimate  of  x^  and  b  is  the  estimate  of  b.  In  (8)  H^x.. ) 

3h  (x. ) 

*  _  iL  _ ?  . . «  i  *  06  l  T  ^  4-  r\  rtl  ha  +  U  «  n  AMm  -1  1  nnn  5  +■  1  nnr1  f  Ql/ 


is  the  derivative, 


.  In  order  to  solve  the  normal  equations,  they 


are  linearized  about  a  guess  trajectory,  x^s?  Let  x.j^,  i  =  1,  N  and  b^Sj 
satisfy  (8) ,  i =e. , 


s  H0Tlx1ts))R:1Ui)(za(t1)-ho{x1(s>)-Sab(s))  =  0  1-l.N  (10) 

a=l 


» 


(11) 


If  (8)  is  linearized  about 


(s) 


and 


,  we  obtain 


Cxrx.^')  = 


(sh  — a'V  fb-b^s)' 

Ai,N+lLb  b 


where 


and 


Ai  *  \  H>iCSV(VH>i(S)) 

of  l 

#!,k*i  ■  j1s.Viv».<Vs)> 


(12) 


(13) 


A-  is  3  X  3  and  Aj  .U1  is  3  X  3M.  Linearizing  the  second  normal  equation,  (9), 

1  f  \  I  I  A 

about  x.|  ;  and  solving  for  b  gives  the  result. 


t-  b(s>  = 


on  T  1  M  T  1  1 

(  i  h  JvSa V  (ti  1  Sa'i  I^iTn+1  Ai  Ai  ,  N+1 }  . 

[  l  1  S  TR*1(ti)(z  (t.)-h  (x.(s))-S  b(s))l 
|i’la“l  a  a  1  a  1  av  1  '  cl  1 1 


(14) 


(11)  and  (14)  for  -  x^  and  b  -  b^  are  the  basic  equations  for  trajectory 
estimation*  The  solution  to  the  normal  equations  are  obtained  by  successive 
relinearization  and  solution  of  (8)  and  (9). 

2.  APPLICATION  OF  PRINCIPAL  COMPONENTS  REGRESSION.  Although  there  are  no 
convergence  problems  in  solving  the  normal  equations  iteratively  for  the 
M-station  radar  case,  another  problem  which  is  fairly  common  in  the  solution  of 
linear  least  squares  problems  also  occurs  freguently  in  trajectory  estimation. 
Very  often,  the  estimate  of  the  bias  vector,  b,  converges  to  a  solution  for  which 
several  of  the  components  are  too  large  and  may  have  the  wrong  sign.  Sometimes 
the  bias  estimate  is  obviously  erroneous.  One  obviously  erroneous  case  which 
arises  frequently  is  that  the  elevation  bias  components  will  all  be  large  and  of 
the  same  sign.  This  problem  of  the  estimated  bias  vector  being  too  long  is 
usually  attributed  to  mul ti col  1  inear ity  among  the  predictor  variables  in  the 
linear  least  squares  problem.  The  problem  in  the  linear  estimation  case  is  often 
successfully  treated  by  some  method  of  biased  estimation.  The  problem  has  not 
been  properly  recognized  or  successfully  treated  when  it  arises  in  trajectory 
estimation.  Although  the  existence  of  these  erroneous  bias  estimates  has  been 
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recognized  in  trajectory  estimation,  the  source  of  the  difficulty  was  not 
properly  recognized.  Some  workers  in  trajectory  estimation  have  stated  that 
the  existence  of  this  problem  demonstrates  the  need  to  specify  a  prior 
distribution  for  the  biases  in  order  to  "tie  down"  or  statistically  constrain 
the  bias  estimates.  It  does  not  take  much  experience  in  using  these  priors 
for  trajectory  estimation  to  realize  that  the  problem  of  inflated  bias  estimates 
is  as  much  present  with  the  prior  as  without  the  prior.  We  have  attempted  to 
treat  this  problem  both  with  ridge  regression  and  with  principal  components 
regression.  We  have  had  some  success  with  both  methods  but  neither  method 
has  entirely  solved  the  problem.  Of  the  two  methods  we  have  had  the  most 
success  with  principal  components,  probably  because  it  is  easier  to  apply. 

The  principal  components  method  also  illuminates  a  problem  that  arises  in  the 
application  of  either  method. 

The  linearized  equation  (14)  to  be  solved  for  the  bias  estimate  can  be 
written  as 


Qb  =  U 


where 


Q  = 


N  3M  T  I  N  T  , 

y  ys1  r"  (t . )s  -  y  a!  w.nA, 'a. 

i=Jjct=*lQt  ala  1  jN+1  l  1  ,N+1 


and 


u.  Vi  STR  Ct.KZ  Ct.)  -h  (4s h  -  S  b(s))  +  Qb^s ^ 

iila^l  a  a  1  a  V  a  1  a 


(15) 


(16) 


Partition  the  bias  vector  into  angle  components  b  and  range  components  b 

a  r 

so  that  the  linear  bias  estimation  equation  is 


07) 


Scale  the  bias  estimates  according  to 


(18) 
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Where  Da  and  are  diagonal  matrices  chosen  so  that  D”^Q  D_1=  Q1  and 
_  -j  _  i  a  a  a  a 

Dr  QrDr  =  each  have  diagonals  of  unity.  The  transformed  bias  estimation 
equation  is 


with  R‘  -  D^RDjJ  and  15  ,11^=0^11^..  Suppose  the  bias  vector  is 


further  transformed  by 


where  T  is  orthogonal  and  diagonalizes 


In  principal  components  regression  the  components  of  3  which  correspond 
to  very  small  eigenvalues  of  Q'  are  set  to  zero.  Specifically,  let  r  be  the 
largest  integer  for  which 


3M 


i=3M-r 

3M 


(22) 


where  Yi  i=l ,  3M  are  the  eigenvalues  of  Q‘  and  we  have  ordered  the  y\  so  that 

Define  B*  =  0  for  r<i<3M  and  B*  31  Bi  otherwise.  The  principal 

components  solution  of  the  original  system  of  equations,  denoted  by  [b*  b*L 

a  r 

is  defined  by 
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Consider  the  following  example  from  WSMR  data.  We  have  three  radars, 
R122,  R123,  and  R395  tracking  a  level  flying  target,  flying  at  an  altitude 
of  about  30,000  ft.  The  graph  of  Fig  1  shows  the  relative  geometry  of  the 
target  trajectory  and  radars.  The  least  square  estimates  of  the  radar 
biases  obtained  for  this  example  are 


R122 

R123 

R395 

Range  bias  (ft) 

TTO 

TTO 

63.7 

Azimuth  Bias  (mi li radians) 

.116 

.058 

.148 

Elevation  bias  (mi li radians 

-.737 

-.947 

-.538 

The  values  of  the  elevation  bias  estimates,  which  are  all  large  and 
negative  illustrate  a  commonly  occuring  type  of  erroneous  solution  in 
radar  trajectory  estimation.  In  this  example  we  are  able  to  confirm  that 
the  radar  bias  estimates  above  are  greatly  in  error.  Using  measurements 
from  tracking  cameras  we  are  able1  to  obtain  trajectory  estimates  which  are 
much  more  accurate  than  trajectory  estimates  obtained  from  radars.  By 
comparing  the  optically  derived  trajectory  against  the  radar  measurements  we 
obtain  the  following  radar  measurement  bias  estimates,  which  we  use  as  a 


standard. 

R122 

R1 23 

R395 

Range  bias  (ft) 

157.3 

152.9 

80.3 

Azimuth  bias  (mi  1 i radians ) 

.05 

.02 

.09 

Elevation  bias  (miliradians) 

.11 

-.08 

-.09 

The  large  errors  in  the  radar  bias  estimates  especially  in  elevation  are  readily 
apparent. 
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FIGURE 
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The  eigenvalues  for  the  Q1  matrix  and  the  corresponding  components  of 

A 

the  solution  vector  3  are: 


^i 


*i 


1.9306182 
1.7776564 
1.7112154 
1.28B728 
1.2173482 
1.0753069 
. 45750007x1 Q"2 
. 1 601 071 9x1 0~z 
, 30452861 xl 0— 3 


-61.638411 

27.106658 

17.060269 

-25.130580 

25.571920 

88.890445 

-481.67132 

-83.385294 

55.338603 


Using  the  criterion  stated  in  (22)  principal  components  regression  will  set 
the  last  three  components  of  3*  to  zero.  The  principal  component  solution  for 
the  radar  bias  estimates  is 


R1 22 

R123 

R395 

Range  bias  (ft) 

11.9 

8.2 

-22.3 

Azimuth  bias  (mil i radians) 

.09 

.03 

.47 

Elevation  bias  (mil i radians) 

.13 

-.08 

-.13 

The  principal  components  method  for  this  example  gives  large  errors  in  the 
estimates  of  the  range  biases  and  a  large  error  in  the  azimuth  bias  estimate 
for  R395,  Thus,  it  appears  that  the  principal  components  method  does  not 
yield  useful  results  when  applied  to  this  example.  However,  we  have  found 
that  the  principal  components  method  does  give  good  results  when  applied  to 
this  example  in  a  slightly  different  way. 

Consider  the  partitioned  bias  estimation  equation  in  (19).  Transform  the 

A  A  A  A 

angle  bias  vector  b^  as  b*  =  Tb,  where  T  is  orthogonal.  The  range  bias,  b', 

a  a  r 

can  be  eliminated  from  (19)  by  substitution.  We  obtain 

=  q;_1(u;  -  r,tts) 
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and 


TT(Qg  '  R,TQ^-1  R ' )TS  =  TTCU^  -  R'TQ|L'1Up 

(24) 

Choose  T  so  that 

tt(q:  -  R,Tq:_1R')T  -  r 

a  r 

(25) 

Then  r£  =  .tt(u;  -  r,tq;."V) 

(26) 

The  values  of  the  radar  bias  estimates  for  the  current  example  obtained  from 
applying  the  principal  component  method  to  this  reduced  problem  are: 


R1 22 

ft!  23 

R395 

Range  bias  (ft) 

147,7 

144,1 

70.2 

Azimuth  bias  (mill radians) 

,01 

-.04 

.11 

Elevation  bias  (mi li radians  1 

,13 

-.08 

-.06 

The  biases  computed  by  applying  the  principal  component  method  to  the  reduced 
bias  estimation  problem  are  statistically  compatible  with  the  radar  biases 
derived  by  comparison  with  optical  tracking  data.  Another  example  of  inflated 
radar  bias  estimates  comes  from  a  recent  missile  vs.  drone  engagement  at  WSMR. 
This  example  will  indicate  some  additional  difficulties  in  applying  principal 
components  to  the  radar  bias  estimate  problem.  In  this  example  we  have  three 
radars,  R124,  R125,  and  R442  tracking  the  missile.  The  least  squares 


estimation  of  trajectory  and 

radar  biases  obtained  the 

following 

bias  estimates. 

R124 

R125 

R442 

Range  bias  (ft) 

253 

307 

164 

Azimuth  bias  (mil: i radians) 

.21 

.21 

-.57 

Elevation  bias  (mi li radians) 

-.39 

-.71 

-.85 
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For  this  example  the  eigenvalues  of  the  reduced  matrix,  Q'  -  R,TQ' ***R' ,  and 

a  r 

a 

the  corresponding  components  of  the  solution  vector  6  are: 


1.8357080 

1.4980308 

1.0704788 

1.2442094 

.25488164 

.096690436 


38.432177 

-42.134995 

7.2953978 

26.848291 

-171.70691 

-442.92985 


If  we  apply  the  criterion  stated  in  (22)  for  zeroing  components  of  6 
corresponding  to  small  eigenvalues,  we  find  that  no  components  should  be 
zeroed  since  even  the  two  smallest  eigenvalues  are  not  very  small.  Nevertheless, 
we  zero  the  components  of  8  corresponding  to  the  two  smallest  eigenvalues. 

This  results  in  the  bias  estimates, 


R124 

R1 25 

R442 

Range  bias  (ft) 

263 

308 

193 

Azimuth  bias  (mi li radians) 

-.07 

-.05 

-.12 

Elevation  bias  (mi liradi ans) 

.05 

-.08 

0 

From  optical  tracking  data  on 

this  drone  we 

computed 

the  folli 

of  the  radar  biases. 

R124 

R125 

R442 

Range  bias  (ft) 

254 

297 

204 

Azimuth  bias  (mi li radians) 

0 

-.04 

-.16 

Elevation  bias  (mi liradi ans) 

0 

-.08 

-.09 
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The  bias  estimates  obtained  by  applying  principal  components  are  much  more 
compatible  with  the  optically  derived  bias  estimates  than  the  bias  estimates 
obtained  from  least  squares.  We  have  tried  several  other  examples  from 
which  we  obtain  similar  results  and  conclusions. 

We  believe  that  we  have  made  significant  progress  in  solving  the  problem  of 
inflated  measurement  bias  estimates  in  trajectory  estimation.  However,  before 
we  could  be  confident  in  the  routine  application  of  the  principal  component 
method  described  above  to  M-station  radar  trajectory  estimation,  several 
questions  should  be  carefully  considered, 

(1)  Why  must  the  principal  component  method  be  applied  to  the  reduced 
bias  estimation  problem  (angles  only)  rather  than  the  full  bias  estimation 
problem? 

(2)  What  criterion  should  be  used  zeroing  components  of  $  corresponding 
to  small  eigenvalues? 

(3)  Should  the  principal  component  method  somehow  be  applied  to 
obtaining  improved  range  bias  estimates? 

(4)  Are  there  other  biased  estimation  methods  such  as  ridge  regression, 
fractional  rank  regression,  etc.,  which  would  yield  better  results  than 

the  principal  component  method  when  applied  to  the  estimation  of  measurement 
biases  in  M-station  radar  trajectory  estimation? 


441 


OPTIMAL  REDUCED  ORDER  CONTROLLERS 
for 

DISCRETE-TIME  LINEAR  SYSTEMS 
with 

PARTIAL  STATE  ESTIMATION 


Maurice  F.  Hutton 

IBM  Corporation 
Federal  Systems  Division 
Owego,  New  York  13827 


ABSTRACT 

The  solution  to  the  standard  steady  state  LOG  problem  is  a 
controller  whose  order  equals  the  order  of  the  plant  model. 
For  many  high  order  plant  models,  a  reduced  order  controller 
which  is  easier  to  mechanize  can  be  used  with  good 
per-formance.  A  methodology  is  described  for  solving  the 
discrete— time,  steady  state  LQG  problem  with  the  constraints 
that  the  controller  be  of  reduced  order  and  that  linear 
combinations  of  the  controller  states  estimate  particular 
plant  model  state  variables.  Formulas  for  simplifying  the 
gradient  calculations  are  introduced. 


NOMENCLATURE 

Dimensions 

Symbol  Definition 

k  =  number  of  plant  states 
1  =  number  of  plant  inputs 
m  =  number  of  plant  outputs 
p  ~  number  of  plant  noise  inputs 
r  =  number  of  controller  states 
s  =  number  of  variables  being  tracked 

Matrices 

Size  Symbol  Definition 

k  x  k  A  =  plant  system  coefficients 

k  x  1  B  =  plant  input  coefficients 

m  x  k  C  =  plant  output  coefficients 

k  x  p  D  =  plant  input  noise  coefficients 

s  x  s  E  =  performance  weights  on  tracking  error 

r  x  r  F  =  controller  system  coefficients 

r  x  m  G  =  controller  input  coefficients 

1  x  r  H  =  controller  output  coefficients 
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s 

X 

k 

K  = 

tracking  coefficients 

s 

X 

r 

L  = 

approximate  tracking  coefficients 

k+r 

X 

k+r 

M  - 

closed  loop  system  coefficients 

k+r 

X 

p+m 

N  = 

closed  loop  input  noise  coefficients 

k+r 

X 

k+r 

P  = 

variance  of  closed  loop  state  vector 

k 

X 

k 

Q  - 

performance  weights  on  plant  state 

1 

X 

1 

R  = 

performance  weights  on  plant  input 

p+m 

X 

p+m 

S  = 

variance  of  closed  loop  input  noise  vector 

s 

X 

k+r 

T  = 

closed  loop  tracking  coefficients 

k+r 

X 

k+r 

U  - 

composite  performance  weights 

P 

X 

P 

V  = 

variance  of  plant  input  noise  vector 

m 

X 

m 

w  = 

variance  of  plant  measurement  noise  vector 

k+r 

X 

k+r 

Y  = 

Lagrange  multipliers 

k+r 

X 

k+r 

z  = 

performance  weights  on  closed  loop  state 

Vectors 

Size 

Symbol  Definition 

s 

e  = 

tracking  error 

k+r 

f  = 

closed  loop  state 

p+m 

g  = 

closed  loop  input  noise 

r 

q  - 

controller  state 

1 

u  - 

plant  input 

P 

v  = 

plant  input  noise 

m 

w  = 

plant  measurement  noise 

k 

X  = 

plant  state 

m 

y  = 

plant  output 

s 

Z  “ 

variables  being  tracked 

5 

h  « 

estimates  of  the  variables  being  tracked 

Scalars 

Symbol  Definition 

b  =  step  size  in  steepest  descent  method 
J  =  performance  index 

J  ~  modified  performance  index 
n  =  discrete-time  index 


I NTRQDUCTDN 

The  motivation  for  this  study  was  an  idea  for  improving  the 
design  used  to  fly  target  aircraft  in  formation  under 
computer  control-  This  Drone  Formation  Control  System 
(DFCS) ,  currently  in  operation  at  the  White  Sands  Missle 
Range  in  New  Mexico,  places  a  premium  on  the  complexity  of 
the  control  design  because  the  computations  must  be 
performed  in  real  time.  However,  the  plant  models  for  a 
single  drone  which  include  airframe,  actuator,  and  autopilot 
dynamics  have  had  as  many  as  32  states.  Applying  the 
standard  steady  state  LDG  method  in  such  a  case  would  result 
in  a  32  state  feedback  controller. 
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An  attractive  alternative  is  to  design  a  reduced  order 
controller  that  minimizes  a  quadratic  performance  index.  The 
minimum  value  of  the  performance  index  -for  the  reduced  order 
design  will  be  larger  than  the  value  achieved  by  the  high 
order  design.  Nevertheless,  a  reduced  order  drone  controller 
should  meet  the  performance  requirements  since  only  a  -few  o-f 
the  32  modes  are  dominant.  The  decrease  in  complexity  is 
expected  to  be  worth  the  small  difference  in  performance. 
This  property  applies  to  many  other  applications  involving 
high  order  linear  models. 

In  the  reduced  order  approach  the  designer  specifies  the 
order  and  structure  of  the  controller.  Hence  the  design 
problem  is  to  find  the  controller  gains  that  minimize  the 
performance  index.  The  theory  for  solving  this  problem  in 
the  continuous- time  case  is  described  in  Cll— C43. 

A  disadvantage  of  using  the  reduced  order  approach  is  that 
the  controller  states  lose  their  physical  meaning.  For  the 
standard  LQG  method,  the  controller  states  (for  a  particular 
realization)  are  estimates  of  the  plant  states.  This  is  not 
true  for  the  reduced  order  controller.  Because  DFCS  must  not 
only  control  but  also  track  the  drones,  this  limitation  of 
the  reduced  order  approach  presents  a  problem.  Fortunately, 
estimates  of  all  32  drone  states  are  not  needed.  For 
tracking  purposes,  the  DFCS  controller  only  needs  to 
estimate  position  and  velocity. 

To  resolve  the  tracking  problem,  the  reduced  order  approach 
is  extended  to  provide  estimates  of  some  state  variables  of 
special  interest  to  the  designer.  To  be  more  exact,  linear 
combinations  of  the  state  variables  may  be  estimated.  The 
extension  is  accompished  by  modifying  the  performance  index 
and  the  structure  of  the  controller. 

The  basic  equations  for  designing  the  reduced  order 
controller  with  partial  state  estimation  are  described 
below.  The  definitions  of  the  symbols  used  in  the  equations 
are  given  in  the  nomenclature  section. 


PROBLEM  FORMULATION 


The  linear,  time- 

-invariant 

model  of  the 

plant  to  be 

controlled  is 

x  (n+1) 

=  Ax  (n) 

+  Bu(n)  + 

Dv(n) 

y  (n) 

-  Cx (n) 

+  w(n) 

In  this  state  space  representati on  the  vectors  x,  u,  y 
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denote  the  state,  input,  and  output,  respectively.  The 
random  vectors  v  and  w  modeling  the  plant  disturbances  and 
measurement  errors  are  zero  mean  white  noise  sequences  with 
variance  matrices  V  and  bJ,  respecti vely- 


EC  vv’>  =  V 
EC  ww’>  -  W 


(2) 


The  operator  EC  >  is  the  steady  state  expectation. 

The  -feedback  controller  is  assumed  to  have  the  -Following 
structure 


q (n+1 ) 

u  <n> 

=  Fq(n) 

-  -Hq (n) 

+  Sy(n> 

(3) 

The  order  of  the 
and  is  less  than 

controller 
or  equal  to 

r  is  selected  by  the  designer 
the  order  k  of  the  plant  model. 

L 

VI 

VI 

(4) 

In  addition  to  computing  the 
controller  must  also  compute 
model  variables  z  which  are 
states.  The  estimates  h  are 
controller  states. 

feedback  control,  the 
estimates  h  of  particular 
linear  combinations  of  the 
linear  combinations  of  the 

plant 

plant 

z  (n) 

=  Kx  (n) 

(5) 

h(n) 

-  Lq(n> 

(6) 

The  tracking  error  e  is  the  difference  between  the  estimated 
and  actual  values  for  the  particular  plant  model  variables 
to  be  tracked, 

e(n)  =  h (n)  -  z(n)  (7) 


The  performance  index  is  the  steady  state  expectation  of  a 
quadratic  form  weighting  the  state,  input,  and  tracking 
error, 

J  =  EC  x’Qx  +  u’Ru  +  e’Ee  >  (8) 


The  first  2  terms  in  (8>  correspond  to  the  standard 
definition  of  the  performance  index  for  the  steady  state  LQG 
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di screte- time  problem*  The  additional  term  provides  a  means 
o*f  extending  the  controller  design  to  perform  partial  state 
estimation. 

To  summarize,  the  objective  is  to  find  the  matrices  F,  G,  H, 
L  that  minimize  the  performace  index  (B)  subject  to 
conditions  <i)~<7).  A  block  diagram  of  the  closed  loop 
structure  is  shown  in  Figure  1.  It  is  assumed  that  the 
design  parameters  or  controller  gains  in  the  minimization 
are  particular  elements  of  the  matrices  F,  G,  H,  L.  This 
assumption  is  more  general  than  necessary  but  adequate  for 
many  applications,  including  the  applications  of  current 
interest  to  the  author-  The  designer  specifies  which  matrix 
elements  are  to  be  design  parameters  as  part  of  the  problem 
setup.  In  order  to  have  a  unique  solution,  not  all  the 
elements  of  the  controller  coefficient  matrices  can  be 
considered  as  design  parameters.  Denery  C51  has  shown  that 
only  r(l+m)  unknown  parameters  are  needed  in  general  to 
determine  the  matrices  F,  G,  H  defining  the  feedback 
control . 

It  is  also  assumed  that  a  solution  exists.  The  conditions 
guaranteeing  that  the  optimization  procedure  will  converge 
to  a  unique  solution  remain  to  be  studied. 


THEORY 


The  -first  phase  o-f  the  derivation  is  to  express  the  closed 
loop  dynamics  in  state  space  -form. 

Let 


Q  m  \  — 


BC 


N  =  - 

0 


Then  the  closed  loop  dynamics  are  described  by 


f(n+l)  -  Mf(n)  +  Ng(n) 


Also  let 


=  [-K  :  LJ 


Then  the  tracking  error  in  terms  of  the  closed  loop  dynamics 


e(n)  -  Tf(n) 


The  second  phase  is  to  find  an  equation  for  the  variance  P 
of  the  closed  loop  state  vector  f  as  a  function  of  the 
variance  S  of  the  closed  loop  input  noise  vector  g  . 

By  definition 

El  ff’>  =  P 

iU 

EC  gg*  >  =  S 


which  from  <2)  and  (9)  imply  that 


S  = 


Since  (11)  is  a  linear,  time-invariant  di screte-time  system 
excited  by  a  white  noise  seqence 

p  =  MPI-T  +  NSN’  (16) 
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The  third  phase  is  to  express  the  performance  index  in  a 
form  more  suitable  for  differentiation.  Rewriting  (S)  gives 

J  =  EC  trC  Gxx’  +  Ruu’  +  Eee?  3  >  (17) 


The  operator  trC  3  denotes  the  trace  of  a  square  matrix 
which  equals  the  sum  of  the  diagonal  elements.  The 
performance  index  is  next  written  in  terms  of  the  closed 
loop  state  vector.  In  order  to  do  this,  define 


Z  * 


(18) 


Substituting  (3)  in  to  the  2nd  term  and  (7)  into  the  3rd 
term  of  (17)  gives  after  applying  (5),  (6),  (9),  (12),  and 

(13) 

J  =  EC  trC  Zff’  +  ETf f ?T*  3  >  (19) 


Using  the  commutive  property  of  the  trace  operator,  (19) 
simplifies  to 

J  =  EC  trC  Uff'  3  >  (20) 


where 


U  =  Z  +  T’  ET  (21) 

Transposing  the  trace  and  steady  state  expectation  operators 
plus  substituting  (14)  results  in 

J  =  trC  UP  3  =  trC  PU  3  (22) 

The  problem  can  now  be  stated  as  finding  the  matrices  F,  G, 
H,  L  that  minimize  (22)  subject  to  the  equality  constraints 
imposed  by  (16). 

This  problem  is  equivalent  to  minimizing  the  following 
function  in  which  the  independent  variables  are  the  unknown 
elements  in  the  matrices  F,  G,  H,  L,  P,  Ys 

(23) 

J  =  trC  UP  +  (  MPM’  +  NSN’  -  P  )  Y  3 

The  matrix  Y  contains  the  Lagrange  multipliers.  The  use  of 
the  modified  performance  index  (23)  instead  of  (22)  and  (16) 
removes  the  constraint  equations.  The  minimal  solution  of 
(23),  which  is  also  the  minimal  solution  of  (22),  is 
computed  numerically  as  described  in  the  next  section. 
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COMPUTATIONS 


The  optimum  solution  of  (23)  can  be  found  by  setting  to  zero 
the  partial  derivatives  of  the  modified  performance  index 
with  respect  to  the  unknown  matrix  elements-  Differentiating 
(23)  with  respect  to  Y  yields  (16).  Differentiating  (23) 
with  respect  to  P  gives  the  following  adjoint  equation: 

Y  =  M'YM  +  U  (24) 

If  the  symmetric  matrices  P  and  Y  are  partitioned  according 
to 


P  = 


then  the  modified  performance  index  in  (23)  can  be  rewritten 
as  an  explicit  function  of  the  matrices  F,  G,  H ,  and  L. 
Differentiating  the  resulting  expression,  with  the  aid  of 
the  formulas  in  Appendix  A,  yields 


1 

bJ 

(26) 

- 

— 

li 

■< 

V 

* 

1 

BHP  )  + 

Y  (  GCP  + 

FP  ) 

2 

3  3 

2 

2  3 

2 

1 

dJ 

- 

— 

=  Y  GW  + 

(27) 

2 

bG’ 

2 

C  Y7 (  AP  - 

BHF7 )  + 

Y  (  GCP  + 

FP7  )  1 

C7 

3  1 

3 

2  1 

3 

1 

bJ 

- 

— 

=  RHP 

(28) 

2 

dH7 

2 

B7  C  Y7  (  AP 

-  BHP  ) 

+  Y  (  GCP 

+  FP  ) 

1 

1  3 

2 

3  3 

2 

1 

bJ 

- 

— 

-  E  (  LP  - 

KP  ) 

(29) 

2 

dL7 

2 

3 
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Recall  that  not  all  the  elements  of  the  matrices  F,  G,  H,  L 
are  neccessarily  design  parameters.  Hence  setting  (26) -(29) 
to  zero  will  most  likely  result  in  more  equations  than 
unknowns-  This  difficulty  is  resolved  by  applying  the 
following  property:  Let  Fij  be  a  design  parameter  of  the  F 
matrix  located  in  row  i  and  column  j.  The  derivative  of  the 
performance  index  with  respect  to  this  design  parameter  is 
simply  the  element  of  the  matrix  computed  by  (26)  that  is 
located  in  row  i  and  column  j-  In  other  words 


dJ 


dJ 


dFi  j 


dFw 


i  J 


(30) 


Equations  similar  to  (30)  apply  to  (27)-<29). 


The  approach  described  by  (26)— (30)  for  computing  the 
partial  derivatives  eliminates  the  need  of  differentiating 
the  modified  performance  index  seperately  for  each  design 
parameter-  Hence,  this  approach  is  especially  useful  for 
computer  programming  purposes. 

The  equations  obtained  by  setting  (26) -(29)  to  zero  can  not 
be  solved  analytically  for  the  design  parameters  in  the 
matrices  F,  G,  H,  L.  Based  on  the  gradient  of  the  modified 
performance  index,  the  design  parameters  are,  instead, 
adjusted  iteratively  to  converge  toward  the  minimum 
solution.  This  technique  is  the  steepest  descent  method  and 
the  key  steps  in  the  numerical  solution  are  briefly 
described  below. 

1.  Make  an  intial  guess  of  F,  G,  H,  L- 

2.  Solve  (16)  for  P. 

3.  Solve  (24)  for  Y. 


4.  Use  (25) -(29)  to  compute  the  partial  derivatives  or 
gradient  matrices.  Those  elements  of  the  4  gradient  matrices 
that  are  not  the  design  parameters  should  be  set  to  zero. 


5.  Compute  a  new  set  of  design  parameters  according  to 


F  =  F 
G  =  G 
H  =  H 
L  =  L 


b(dJ/dF’ ) 
b<dJ/dG’ > 
b  (dJ/c)H* ) 
b(dJ/dL* ) 


b>0 


(31) 
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The  scalar  b  determines  the  magnitude  of  the  adjustment  in 
the  solution.  The  value  of  b  is  selected  large  enough  to 
provide  rapid  convergence  and  yet  small  enough  to  insure  an 
accurate  solution. 

6.  Repeat  steps  2-6  using  the  new  values  of  the  matrices  F, 
G,  H,  L  until  convergence  occurs. 


CONCLUSIONS 


An  attractive  method  for  computer-aided  control  system 
design  is  the  steady  state  LOG  theory  modified  to  permit 
reduced  order  controllers.  To  apply  this  approach  the 
designer  has  the  additional  task  of  specifying  the  order  and 
configuration  of  the  controller*  The  advantage  is  that  the 
resulting  controller  design  is  simpler  and  hence  better  able 
to  meet  the  demands  of  real-time  control. 

One  of  the  limitations  of  using  the  reduced  order 
controller,  which  is  addressed  by  the  this  paper,  is  that 
the  controller  states  can  no  longer  be  interpreted  as 
estimates  of  the  plant  states.  It  is  shown  that  the  design 
method  can  be  extended  so  that  the  controller  will  also 
estimate  key  variables.  This  extension  does  not  alter  the 
basic  mathematical  structure  of  the  equations  to  be  solved 
for  the  controller  design. 

The  extension  permits  the  designer  to  tradeoff  estimation 
accuracy  versus  control  accuracy  by  varying  the  weighting 
matrices  in  the  performace  index.  The  sensitivity  of  the 
closed  loop  performance  to  increases  in  the  weighting  on  the 
estimation  accuracy  remains  to  be  investigated. 

Special  formulas  are  presented  for  computing  the  derivative 
of  the  trace  operator  with  respect  to  a  matrix.  These 
formulas  are  useful  in  developing  a  general  computer  program 
to  implement  the  design  method. 

The  di screte— ti me  control  problem  is  considered  in  this 
paper  but  the  method  readily  extends  to  the  continuous— time 
case. 

The  future  use  of  the  method  for  real-time  drone  control  and 
other  applications  appears  promising. 
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Appendix  A:  MATRIX  GRADIENT  FORMULAS  FOR  TRftCE  OPERATOR 


Formulas  for  computing  derivatives  of  scalar  functions  with 
respect  to  a  matrix  are  listed  below.  The  class  of  scalar 
functions  is  the  trace  of  a  matrix  product.  These  formulas 
are  useful  in  the  computation  of  the  gradient  used  by  the 
steepest  descent  method  in  finding  the  solution  that 
minimizes  the  performance  index. 


H  =  dt  rt  F  (X)  G 


F  =  A  H 
F  =  X  H 
F  -  AX  H 
F  “  AX'  H 
F  =*  XB  H 
F  =  X*  B  H 
F  *  AXB  H 
F  =  AX7  B  H 
F  =  X'AX  H 
F  =  XAX 7  H 
F  =  B7  X’ AXB  H 
F  =  BXAX 7  B7  H 


1  /  dX7 

0 
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A’G 

G7  A 

GB7 

BG7 

A7  GB7 

BG7  A 

AXG7  + 

A7  XG 

GXA7  + 

G7  XA 

AXBG7  B7 

+  A7  XBGB7 
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+  B7  G7  BXA 
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ABSTRACT .  As  part  of  a  program  to  select  an  optimal  tactical  weapon  sys¬ 
tem  for  specified  environmental  conditions  a  procedure  is  presented  for  pre¬ 
dicting  the  apparent  infrared  image  of  a  target  in  a  background  that  would  be 
seen  by  an  infrared  sensor  in  a  missile  or  airplane.  Environmental  effects 
are  introduced  through  the  exchange  of  energy  between  the  target,  atmosphere, 
and  background  by  radiation,  conduction,  convection,  and  latent  heat.  The 
diurnal  variation  of  the  temperature  and  radiance  of  target  and  background 
surfaces  are  calculated  including  the  effect  of  target  orientation  relative  to 
the  sun.  Numerical  calculations  are  presented  for  temperatures  and  radiances 
of  the  facets  of  a  petroleum  storage  tank. 

INTRODUCTION.  In  any  future  conflict,  technology  may  provide  us  with  our  most 
effective  advantage  over  a  numerically  superior  foe.  Technology  in  this  sense 
infers  not  only  more  capable  and  sophisticated  systems,  but  also  the  ability 
to  use  them  effectively.  Emphasis  on  high  technology  has  generated  numerous, 
remarkably  capable  systems  exemplified  by  the  list  of  Army  electro-optical  de¬ 
pendent  systems  given  in  Table  la.  These  devices  and  more  advanced  systems 
under  development  will  provide  significantly  enhanced  surveillance,  target  ac¬ 
quisition,  and  terminal  homing  capabilities.  Doctrine  and  tactics  to.  exploit 
the  advantages  these  systems  provide  must  account  for  their  increased  sensi¬ 
tivity  to  the  operational  environment,  the  price  tag  for  extended  performance 
and  increased  autonomy. 

The  European  environment  can  be  especially  hostile  to  electro-optical 
systems.  Table  lb  illustrates  that  visual  systems  are  nearly  useless  in  the 
winter  to  detect  and  lock  on  to  a  tank  target  at  a  range  of  2750  m  (1).  Infra 
red  (IR)  systems  fare  better;  however,  they  are  still  ineffective  for  a  signif 
icant  fraction  of  the  time,  especially  in  the  winter.  A  synopsis  of  West  Ger¬ 
man  weather  shows  that  in  the  winter  overcast  cloud  conditions  occur  70  to 
80  percent  of  the  time  (Table  lc)  (1).  Cloud  cover  can  severely  impact  visi¬ 
ble  and  IR  system  performance  by  obscuration  (reduction  of  atmospheric  trans¬ 
mission)  and  for  IR  systems  a  suppression  of  target-background  contrast. 

Obviously,  a  single  high  technology  system  is  not  always  the  optimum 
choice  for  a  particular  mission.  It  is  imperative  to  know  when  they  will  work 
and  when  they  will  not,  which  systems  will  be  most  effective  in  an  anticipated 
set  of  conditions,  and  what  performance  level  can  be  expected  for  the  system 
selected.  This  requires  the  implementation  of  the  second  component  of  tech¬ 
nology  mentioned  in  the  first  paragraph,  the  ability  to  project  system  per¬ 
formance  and  optimize  the  effectiveness  of  systems  by  choosing  the  most  appro¬ 
priate  alternative  for  the  anticipated  operational  environment.  This  is  the 
essence  of  the  Tactical  Decision  Aid  (TDA)  concept  (Figure  la). 
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In  the  case  of  advanced  electro-optical  controlled  weapon  systems  the 
concept  embraces  the  development  of  analytical  methods  to  forecast  system  per¬ 
formance  for  the  conditions  anticipated  at  the  time  of  target  engagement. 

This  includes  the  type  of  target,  the  terrain  conditions  (background)  sur¬ 
rounding  the  target,  weather  conditions,  and  atmospheric  transmission.  It  is 
in  essence  an  analysis  of  sensor-environment  interactions  coupling  forecasts 
of  future  weather  conditions,  atmospheric  transmission,  and  the  dynamic  re¬ 
sponse  characteristics  of  targets  and  backgrounds  using  mathematical  algo¬ 
rithms  that  mimic  the  most  significant  interactions.  The  TDA  is  a  projection 
of  performance  for  alternative  systems,  formulated  to  clearly  illustrate  which 
system  to  use,  when  to  use  it,  and  how  well  it  will  work.  As  such,  the  TDA  is 
a  near-real  time  mini-war  game  that  strives  to  optimize  effectiveness  of  our 
high  technology  systems . 

The  prediction  of  the  performance  of  a  sophisticated  system  must,  realis¬ 
tically,  account  for  some  rather  complex  phenomena.  For  this  reason,  the 
basic  models  should  be  based  on  physical  principles,  and  should  predict  the 
strength  of  the  intrinsic  signal  of  the  target  and  its  attenuation  during 
transmission  through  the  environment  to  the  weapon  sensor.  The  environment 
affects  both  the  intrinsic  target  signal  and  the  apparent  signature.  The  pre¬ 
diction  schemes  must  be  easily  executed  and  simple  to  use,  and  therefore  the 
final  package,  from  the  outward  appearance,  may  not  reflect  the  true  character, 
and  sensitivities  of  the  internal  analytical  relationships  used  to  formulate 
the  prediction.  For  this  reason  the  user  must  be  familiar  with  the  structure 
of  the  procedures  and  the  potential  limitations  of  the  analytical  tools  used. 

The  performance  of  modern  weapon  systems  depends  on  the  acquisition  of 
target  location  and  appearance  data.  Weather  conditions  affect  the  appearance 
of  a  target  relative  to  its  background.  Prediction  of  this  appearance  is 
crucial  to  the  selection  of  a  weapon  system.  It  is  desirable  to  be  able  to 
predict  the  image  of  a  target  in  different  modes  of  energy  transfer  such  as 
radar,  infrared,  millimeter  and  submillimeter  waves,  and  others.  A  model  that 
predicts  the  characteristic  signatures  of  a  target  in  various  forms  of  energy 
transfer  and  in  all  weather  conditions  is  needed  to  project  system  performance. 
Some  weapon  systems  will  be  better  suited  for  a  particular  battlefield  en¬ 
vironment  than  others.  Modern  defense  systems  are  fast  and  accurate  thereby 
limiting  the  time  available  for  the  on  site  selection  of  the  appropriate  wea¬ 
pon  system. 

The  TDA  concept  is  based  on  the  idea  that  a  particular  weapon  system  can 
be  selected  which  is  best  suited  for  locating  and  destroying  a  specified  tar¬ 
get  under  expected  environmental  conditions.  Ideally  this  selection  would  be 
made  before  the  launch  of  the  weapon  system.  The  probability  of  success  of  a 
weapon  is  determined  in  part  by  environmental  factors.  These  probabilities 
are  used  to  assess  alternatives  and  to  select  an  optimum  system.  The  antici¬ 
pated  appearance  of  the  target  with  respect  to  the  background  would  be  used 
also  to  brief  a  pilot  or  for  insertion  into  the  memory  of  a  missile  computer. 

The  types  of  environmental  information  required  are:  weather,  terrain 
elevations,  soil  moisture  content,  vegetation,  soil  and  vegetation  temperature, 
position  of  the  sun  in  the  sky,  latitude  of  target,  atmospheric  dust,  smoke, 
and  smog.  The  environmental  factors  can  be  obtained  from  a  variety  of  sources. 


456 


For  instance,  weather  conditions  can  be  obtained  from  satellite  pictures,  ter¬ 
rain  elevations,  and  slopes  from  photogrammetry ,  soil  moisture,  and  the  temper¬ 
ature  of  soil  and  vegetation  from  infrared  photographs,  and  atmospheric  dust, 
smoke,  and  smog  from  measurements  of  light  scattering  and  attenuation  (2). 
However,  systematic  methods  for  obtaining  the  data  to  support  combat  opera¬ 
tions  remains  a  problem. 

A  complete  environment-target  model  would  predict  the  quality  and 
strength  of  the  target  and  background  signatures  for  different  modes  of  energy 
transfer  and  different  detectors.  These  signatures  will  vary  with  the  time  of 
day  as  well  as  with  environmental  conditions.  On  the  basis  of  this  informa¬ 
tion  an  optimal  weapon  system  can  be  selected  and  programmed  to  recognize  and 
attack  a  target  in  a  background  for  specified  weather  conditions  and  at  any 
time  of  day  or  night  (Figures  lb  and  lc). 

OBJECTIVE.  The  objective  of  this  paper  is  to  present  details  of  a  concept  for 
predicting  thermal  signatures  of  high-value  targets  and  their  backgrounds  and 
to  illustrate  the  role  of  such  a  capability  as  a  critical  component  in  a  TDA 
procedure.  The  TDA  concept  is  first  discussed  within  the  perspective  of  the 
overall  structure,  sources  of  inputs,  and  the  sophistication  of  critical  com¬ 
ponent  algorithms.  Through  this  discussion  an  effort  is  made  to  outline  the 
critical  component  of  TDA  procedures  and  to  illustrate  the  inherent  complexity 
required  to  forecast  the  performance  of  modern  weapons  systems. 

The  concepts  for  high-value  target  and  background  thermal  signature  pre¬ 
diction  is  presented  in  detail.  High  value  targets  are  defined  as  large  crit¬ 
ical  facilities  such  as  airfields,  POL  storage,  bridges,  power  plants,  and 
rail  yards.  While  the  procedures  presented  are  generic  in  nature,  the  high- 
value  target  model  described  is  being  developed  for  the  Air  Force  Armaments 
Laboratory  and  Air  Force  Geophysics  Laboratory  as  a  component  to  their  TDA 
program  (Figure  Id). 

A  HIGH  VALUE  TARGET  TDA.  A  procedure  for  projecting  the  performance  of  an  IR 
weapons  system  design  to  detect  and  lock-on  high  value  targets  would  require 
four  major  components;  a  target-background  signature  model,  a  sensor  charac¬ 
teristics  model,  an  atmospheric  transmission  model,  and  systems  models  to  tie 
together  the  basic  component  models  and  provide  a  means  to  estimate  system 
performance  for  the  scenario  described  by  the  input  data. 

The  target-background  signature  model  function  is  to  project  the  inherent 
(at  the  target)  appearance  of  the  target  within  the  background  for  specified 
weather  conditions.  If  the  weapons  system  of  interest  detects  and  locks-on  by 
a  simple  contrast  criteria  the  model  description  of  target  and  background 
geometry  may  remain  quite  simple.  The  average  radiance  of  the  target  and  the 
background  may  he  sufficient  information  to  project  performance.  If  back¬ 
ground  complexity  is  a  factor  in  sensor  performance  or  the  target  has  specific 
parts  that  are  significantly  hotter  than  the  remainder  of  the  structure  of 
facility  a  more  complex  description  of  the  target  and  background  are  required. 
This  may  require  supplementing  an  overall  average  contrast  value  with  informa¬ 
tion  on  background  complexity  (clutter  level)  and/or  target  hot  spots  that  may 
be  more  critical  to  system  performance  than  the  average  contrast  value.  If  an 
imaging  sensor  is  being  considered  the  full  geometric  complexities  of  the 
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target  and  background  must  be  included.  The  signature  simulation  then  expands 
essentially  to  a  full  scene  simulation. 

The  actual  signature  calculation  consists  of  an  energy  budget  analysis  of 
the  various  natural  background  and  target  surfaces  that  would  be  within  the 
field  of  view  of  the  sensor.  The  energy  fluxes  are  driven  by  the  character¬ 
istics  of  the  surfaces  and  the  weather.  While  one  dimensional  models  may  ade¬ 
quately  handle  surfaces  for  simple  targets  and  contrast  dependent  sensors, 
consideration  of  energy  fluxes  for  scene  simulation  requires  much  more  complex 
models  that  allow  energy  exchanges  between  target  and  background  surfaces  as 
dictated  by  their  orientation  and  proximity. 

Sensor  characteristic  models  essentially  describe  the  interaction  of  sen¬ 
sor  components  with  the  received  signals  from  the  target  and  background.  The 
sophistication  of  the  model  is  in  direct  proportion  to  the  sophistication  of 
the  sensing  device  and  any  internal  logic  used  to  detect,  classify,  and 
lock-on  to  a  target.  Once  again  a  contrast  seeker  is  more  simply  simulated 
than  an  imaging  system.  The  first  requires  only  a  signal-to-noise  analysis 
while  the  later  requires  consideration  of  the  radiance,  geometry,  texture,  and 
association  of  the  target  and  background. 

Atmospheric  transmission  models  are  needed  to  project  the  inherent  target 
and  background  signatures  estimated  at  the  location  of  the  target  to  the  sig¬ 
natures  received  at  the  position(s)  of  the  sensor  system.  For  modeling  real¬ 
istic  battlefield  conditions  it  is  necessary  to  include  the  effects  of  natural 
atmospheric  absorption  and  scattering  phenomena  as  well  as  the  very  dramatic 
impacts  of  battlefield  induced  contaminants  such  as  dust  and  smoke. 

The  systems  performance  model  integrates  the  previously  described  com¬ 
ponent  models  to  mimic  the  entire  system  from  signature  generation,  through 
atmospheric  attenuation,  to  interaction  with  the  sensor.  It  acts  as  an  execu¬ 
tive  program  for  providing  inputs  to  the  specific  component  models,  trans¬ 
ferring  one  component's  output  to  another  component  for  further  processing  and 
finally  shaping  the  output  of  the  entire  system.  While  the  systems  perfor¬ 
mance  model  is  the  glue  that  ties  the  components  together  it  is  the  complexity 
and  capabilities  of  the  component  models  that  really  dictate  the  capabilities 
of  the  overall  TDA  procedure  and  its  limitations. 

INPUTS  AND  THEIR  SOURCES.  The  individual  component  models  require  some  common 
and  some  unique  inputs.  The  sensor  characteristic  model  primarily  requires 
indicies  related  to  sensor  performance  such  as  electrical  bandwidth,  spatial 
resolution,  spectral  sensitivity,  signal  processing  transforms,  and  logic/ 
classified  criteria. 

The  atmospheric  transmission  model  requires  a  description  of  atmospheric 
aerosol  and  molecular  constituents.  Inputs  should  be  tied  closely  to  regional 
atmospheric  characteristics,  local  weather  conditions,  and  battlefield  activi¬ 
ties.  This  can  be  handled  most  easily  by  using  a  series  of  preestablished 
look-up  tables  representing  a  range  of  anticipated  atmospheric  conditions.  In 
this  manner,  an  appropriate  atmospheric  condition  is  selected  from  a  menu  and 
the  system  performance  code  provides  the  required  information  to  the  atmo¬ 
spheric  model  to  make  representative  atmospheric  transmission  calculations. 
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The  major  input  requirement  is  for  the  target-background  signature  model. 
Calculation  of  energy  fluxes  requires  a  description  of  the  thermal  and  physi¬ 
cal  characteristics  of  the  target  and  background  surfaces  of  most  significance. 
These  are  basically  static  parameter  values  that  can  be  provided  from  look-up 
tables  keyed  to  specific  surface  types  and  conditions.  Some  surface  thermal 
properties,  such  as  the  thermal  conductivity  of  soil,  are  dynamic  and  correct 
parameter  value  estimates  require  a  determination  of  the  state  of  the  surface 
for  the  time  of  signature  forecast  and  in  many  instances  some  prior  period. 

Weather  is  the  principal  driver  of  the  energy  fluxes  as  well  as  the  state 
of  the  surfaces.  As  such  it  becomes  the  most  critical  input  to  the  TDA  pro¬ 
cedure,  Wind  speed,  air  temperature,  humidity,  cloud  cover,  rainfall,  and 
solar  insolation  data  on  a  timely  basis  are  critical  to  accurate  calculation 
of  surface  IR  signatures.  Since  the  object  is  to  estimate  signatures  and 
sensor  performance  at  some  time  in  the  future,  the  weather  data  input  by  neces¬ 
sity  must  result  from  forecasts  or  extensions  of  measured  data.  As  the  time 
lapse  between  the  last  measured  weather  input  and  the  time  of  engagement  in¬ 
creases,  the  accuracy  of  the  forecast  weather  data  input  to  the  model  clearly 
diminishes.  The  TDA  procedures  must  be  designed  to  provide  any  advantage  pos¬ 
sible  to  offset  the  possible  inaccuracies  in  the  weather  inputs. 

Since  weather  data  are  critical  to  the  execution  of  a  high-value  target 
TDA  a  near-real  time  weather  data  acquisition,  reduction,  and  processing  capa¬ 
bility  is  needed  to  provide  the  procedure  with  the  best  possible  information. 
Progress  in  mesoscale  weather  models  would  add  materially  to  the  reliability 
of  such  TDA  performance  projections, 

INFRARED  TARGET  AND  ENVIRONMENT  MODEL  FOR  TDA.  A  specific  example  of  a  target- 
environment  model  that  can  be  used  in  a  TDA  program  is  the  IR  imaging  of  high 
value  targets  such  as  airfield  runways  and  industrial  complexes  such  as  petro¬ 
leum  storage  tanks.  The  predicted  image  of  a  target  in  a  background  should 
agree  with  the  image  displayed  by  an  infrared  detector  in  a  missile  or  air¬ 
plane.  This  prediction  tells  the  pilot  or  missile  computer  what  type  of  ac¬ 
tual  IR  image  to  expect  during  a  tactical  military  mission.  This  will  reduce 
the  recognition  time  for  a  target  which  may  be  camouflaged  and  not  easily  seen 
in  visible  light. 

The  IR  imaging  model  consists  of  two  basic  components:  1)  a  target  and 
background  IR  radiance  model,  and  2)  a  scene  generation  model  which  projects 
the  intrinsic  target  and  background  radiances  into  the  focal  plane  of  the 
sensor  of  the  attacking  missile  or  airplane.  The  target  and  background  radi¬ 
ances  are  calculated  using  the  computer  program  Terrain  Surface  Temperature 
Model  (TSTM) ,  and  the  infrared  scenes  are  generated  from  a  computer  program 
called  Scene  Generator  (SCNGEN)  (3,4).  The  SCNGEN  computer  program  requires 
as  input  the  target  and  background  radiances  generated  by  TSTM.  Both  compo¬ 
nent  models  require  input  data  from  the  environment  (Figures  2a  and  2b). 

BASIC  PHOTOMETRIC  DEFINITIONS.  The  spectral  radiance  of  a  target  surface  is 
defined  as  the  amount  of  power  radiated  per  unit  area,  per  unit  solid  angle, 
and  per  unit  wavelength  as  follows  (5-7) 

dP  =  cos  6  dA  d ft  d\  (1) 
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where 


P  =  emitted  power,  watts 

-2  -1  -1 

=  spectral  radiance,  watts  *  m  *  pm  *  sr 

0  =  angle  between  direction  of  observation  and  the  normal  to  the  target 
surface,  radians 

? 

A  =  area  of  target  surface,  m 
Q  =  solid  angle,  steradians  (sr) 

X  =  wavelength,  pm 

The  radiance  of  a  surface  is  defined  as 


dA, 


watts 


(2) 


and  is  associated  with  specified  wavelength  intervals.  The  target  facet  radi¬ 
ances  used  in  TSTM  are  associated  with  the  3-5  and  7-14  micron  wavebands. 


The  magnitude  of  the  IR  radiance  associated  with  the  background  and  tar¬ 
get  is  due  to  a  number  of  physical  processes  associated  with  the  absorption, 
emission,  and  reflection  of  electromagnetic  energy.  The  total  IR  radiance  as¬ 
sociated  with  target  and  background  surfaces  consists  of  a  thermal  component 
and  a  reflected  component.  The  thermal  radiance  depends  on  the  absolute  tem¬ 
perature  of  the  target  and  background  facets. 

GROUND  TEMPERATURE .  The  surface  temperature  of  the  background  is  calculated 
by  the  TSTM  (3).  Several  physical  processes  are  included  in  this  model  (fig¬ 
ure  3a): 

a.  radiative  absorption  and  thermal  emission  of  the  surface 

b.  conduction  of  heat  in  target  and  background 

c.  conduction  and  convection  by  the  atmosphere 

d.  latent  heat  of  evaporation 

The  equilibrium  surface  temperature  is  determined  by  satisfying  an  energy  flow 
budget  equation  at  the  surface  of  a  target  or  background  (3,8).  The  energy 
budget  includes  the  absorption  of  a  SW  insolation  component  and  a  LW  atmo¬ 
spheric  term.  The  ground  temperature  is  calculated  from  the  following  two 
major  irradiance  terms: 


a.  Direct  SW  Insolation  on  Ground 
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The  SW  solar  irradiance  on  the  ground  is  given  by  (3) 


H®(SW)  =  S  W.  cos  Z„ 
0  Ol  s 


(3) 


where 

H^(SW)  =  solar  irradiance  on  ground,  watts  *  m  ^ 

_2 

S  =  solar  constant  =  1395.0  watts  *  m 
o 

=  cloud  cover  function  (reference  3) 

Z  =  solar  zenith  angle 

b.  Direct  LW  Atmospheric  Irradiance  on  Ground 

The  atmospheric  LW  irradiance  on  the  ground  is  given  by  the  Brunt  equa¬ 
tion  (3) 

HftraaW)  =  W2(Ta,ea)  (4) 

where 


^atm  =  ^  irradiance  on  ground,  watts  *  m  ^ 

W9(T  ,e  )  =  air  temperature  and  water  vapor  function  (reference  3) 

4  a  u 

T^  =  air  temperature 
e^  -  water  vapor  pressure 

The  LW  irradiance  is  diffuse.  The  W~(T  ,e  )  function  contains  empirical 
coefficients.  a  a 

IRRADIANCE  ON  TARGET  FACET.  The  TSTM  was  originally  designed  to  predict  the 
ground  temperature  and  the  temperatures  of  horizontal  surfaces.  This  model 
has  been  modified  to  describe  vertical  surfaces  by  including  the  important 
radiative  interchanges  between  the  vertical  target,  the  atmosphere,  and  the 
background. 

The  radiative  transfer  at  the  target  surfaces  has  longwave  (LW)  and 
shortwave  (SW)  components  and  include  the  following  five  major  irradiance  com- 
ponents  (Figures  3b-3d) : 

a.  Direct  SW  Insolation  on  Target 

The  direct  SW  solar  irradiance  on  a  target  surface  is  given  by 

H^ar(SW)  =  SqW1  cos  0t  (5) 
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where 


far  ■  2 

Ho  (SW)  =  solar  irradiance  on  target  surface,  watts  *  m 

(j>t  =  angle  between  surface  normal  and  the  direction  of  the  sun 
A  simple  analysis  shows  that 

cos  <|>t  =  sin  Zg  sin  (SL)  cos  (S^  “  SL^)  +  cos  C0S  (6) 

where 

SL  =  slope  angle  of  target  surface 
SAZ  =  so^ar  a2imuth 

SLa^  =  azimuth  angle  of  normal  to  target  surface 

b.  SW  Solar  Radiation  Reflected  from  Background  to  Target 

The  SW  irradiance  on  the  target  surface  due  to  reflection  from  the 
ground  is  given  by 

Href,g(SW)  =  rg(SW)SoWl  cos  2s  co,s  (7) 

where 

r  (SW)  =  SW  reflected  coefficient  of  ground  =  1  -  £  (SW) 
g  6 

£  (SW)  =  SW  absorptivity  of  ground 
g 

R 

<j>t  =  angle  between  surface  normal  and  the  reflected  sun  ray 

The  SW  reflectivity  may  depend  on  the  solar  zenith  angle,  especially  for  the 
case  of  water  surfaces.  Simple  geometry  shows  that 

cos  (j)^  =  sin  Zg  sin  (SL)  cos  (S^  “  SL^)  -  cos  Zg  cos  (SL)  (8) 

c.  Direct  LW  Atmospheric  Irradiance  on  Target  Surface 

The  atmospheric  LW  irradiance  on  a  target  surface  is  assumed  to  be 
given  by  the  Brunt  equation  as  in  the  case  of  the  LW  irradiance  on  the 
ground  (equation  4) 


H^(LW)  ,  W2(Ta,ea) 
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fa  r 

where  H  t  (LW)  =  direct  LW  irradiance  on  target  surface,  watts  *  m 

d*  IW  Atmospheric  Radiation  Reflected  from  Ground  to  Target 

Because  the  atmospheric  LW  radiation  is  diffuse  in  nature  the  radiation 
reflected  from  the  ground  will  also  be  diffuse,  and  the  irradiance  on  a  raised 
target  surface  is  given  by 


Hta* 

ref.g 


(LW) 


kff- 


ref(LW) 


cos  0^  cos  0 


A 


dA  dA 

g  t 


(10) 


where  the  diffuse  reflected  LW  ground  radiance  is  given  by 


r  (LW) 

N?ef(LW)  =  A—  Hftm(LW) 


(11) 


and  where 


0 

g 


r 


target  area 

angle  between  normal  to  target  and  reflected  ray 

angle  between  normal  to  ground  and  incident  ray 

variable  distance  between  points  on  ground  and  on  target 
surface 


r  (LW) 
6 

e  (LW) 

5 


area  of  ground 

LW  reflectivity  of  ground  =  1  -  e^(LW) 

LW  absorptivity  of  ground  =  LW  emissivity  of  ground 


For  the  case  of  a  homogeneous  half-plane  background  the  integral  in  equa 
tion  10  simplifies  to  the  result 


tar 
ref  ,g 


(LW) 


=  I  Nref(LW) 


(10a) 


e*  LW  Irradiance  on  Target  Due  to  Thermal  Radiance  of  the  Ground 

The  diffuse  LW  irradiance  on  a  raised  target  due  to  the  thermal  radi¬ 
ance  of  the  ground  is  given  by 


jjtar 

ther,g 


(LW)  = 


1  Nfhe/LW)  CQ*  9t 


cos  0 
- dA 

g 


(12) 
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where  the  thermal  ground  radiance  is  obtained  from  Planck’s  law  as 

d  k 


Nfher(LW)  =  £g(LW)C 


if 


[exp  (d[/\T)  -  l] 


(13) 


where 


^ther(^)  =  t^ierraa^  tadiance  of  ground 

C*  =  1.192  x  108  watts  •  (pm)^  •  m  2  *  sr 
c!J  =  1.4338  x  104  pm  •  °K 
T  =  ground  temperature,  °K 


The  IR  wavebands  are  taken  to  be  3-5  and  7—14  microns.  For  the  case  of  a 
homogeneous  half-plane  background  the  integral  in  equation  12  simplifies  to 


utar 

ther ,g 


(LW) 


=  I  Nther(LW> 


(12a) 


The  total  SW  irradiance  on  the  target  surface  is  obtained  from  equa 
tions  5  and  7  to  be 


Htar(SW)  =  H^ar(SW)  +  H£a£  (SW) 


(14) 


The  total  LW  irradiance  on  the  target  surface  is  obtained  from  equations  9, 
10,  and  12  to  be 

Htar(LW)  =  h£(LW)  +  dgm  +  H£aerr>g(LW)  (15) 

Finally  the  total  irradiance  on  a  target  surface  is  given  by 

Htar  =  Htar(SW)  +  Htar(LW)  (16) 


which  has  a  SW  and  a  LW  contribution.  For  horizontal  surfaces  such  as  the 
ground,  the  target  irradiances  given  in  equations  14  and  15  reduce  to  those 
given  in  equations  3  and  4,  respectively. 

Less  important  radiative  transfer  effects  are: 

a.  SW  solar  radiation  reflected  from  target  surface  to  the  background 
and  reradiated  as  IR  thermal  emittance  which  irradiates  target. 
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b.  Atmospheric  IR  radiation  reflected  from  target  surface  to  the  back¬ 
ground  and  reradiated  as  a  thermal  IR  emittance  which  in  turn  irra¬ 
diates  the  target. 

TARGET  FACET  TEMPERATURE.  The  absorbed  part  of  the  total  irradiance  on  a  tar¬ 
get  surface  enters  the  calculation  of  the  target  surface  temperature.  The  ab¬ 
sorbed  power  per  unit  is  obtained  from  equations  14  and  15  as 

Habs  =  et(SW)Iltar(SW)  +  £taw)Htar(LW)  (17) 


where 


Ha^s  =  power  absorbed  per  unit  area,  watts  *  m 

£t(SW)  =  SW  absorptivity  of  target  surface 

£t(LW)  -  LW  absorptivity  of  target  surface 

It  is  this  total  absorbed  power  that  enters  the  calculation  of  target  facet 
temperature  using  the  computer  program  TSTM.  The  procedure  for  calculating 
the  target  surface  temperature  using  equation  17  is  essentially  the  same  as 
used  for  calculating  the  ground  temperature. 

For  surfaces  that  are  not  horizontal  the  orientation  of  the  surface  with 
respect  to  the  direct  and  reflected  radiation  components  enters  the  irradiance 
calculation.  This  introduces  a  dependence  of  the  predicted  target  surface 
temperature  on  the  slope  and  azimuth  angles  of  the  surface,  and  a  dependence 
on  the  solar  zenith  and  solar  azimuth  angles. 

TARGET  FACET  RADIANCE.  The  total  IR  emittance  from  a  target  or  background 
surface  is  the  sum  of  a  reflected  IR  component  and  a  thermal  IR  emittance 
which  is  calculated  from  the  predicted  target  surface  temperature. 

a.  Thermal  IR  Radiance  of  Target 


The  thermal  radiance  of  a  target  or  background  surface  is  calculated  from 
Planck’s  black  body  radiation  law  as  follows  (Figure  3d) 


Olw> 


=  et(LW)C^  f 


d\ 


A5  exp  (c|/XT)  -  l" 


(18) 


where 


^ther  =  thermal  radiance  of  target,  watts  *  m  ^  *  sr  ^ 
T  =  target  surface  temperature,  °K 
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The  IR  transmission  windows  are  taken  to  be  3-5  and  7-14  microns.  The  im¬ 
portant  parameters  for  the  determination  of  thermal  radiance  are  the  target 
surface  temperature,  the  emissivity  of  the  surface,  and  the  transmission 
wavebands . 

b.  Reflected  IR  Radiance 


In  addition  to  the  thermal  IR  radiance  of  a  surface  there  is  also  a  re¬ 
flected  IR  radiance  which  for  a  vertical  surface  is  due  in  part  to  the  IR 
atmospheric  radiation  reflected  directly  from  the  target,  and  in  part  to  the 
IR  atmospheric  radiation  reflected  from  the  background  and  then  reflected 
again  from  the  target  surface.  There  is  also  an  IR  reflected  component  due  to 
the  thermal  emittance  of  the  background  which  is  reflected  by  the  target  sur¬ 
face.  The  reflected  IR  radiance  is  calculated,  under  the  assumption  that  it 
is  diffuse,  in  the  following  manner  from  equation  15  (Figure  10) 


N 


tar 

ref 


(LW)  = 


rt(LW) 

n 


Htar(LW) 


where  rt(LW)  =  LW  reflectivity  =  1  -  et(LW)  . 


(19) 


The  total  target  facet  IR  radiance  is  obtained  finally  as  the  sum  of  the 
thermal  radiance  and  the  LW  reflected  radiance  as  follows 


Ntar(LW)  =  N^r(LW)  +  N^jkLW) 


(20) 


This  is  the  intrinsic  target  radiance  that  serves  as  input  for  the  determina¬ 
tion  of  the  apparent  radiance  seen  by  the  IR  detector, 

APPARENT  TARGET  AND  BACKGROUND  RADIANCE,  The  apparent  target  and  background 
IR  radiances  measured  in  the  focal  plane  of  an  IR  detector  in  an  aircraft  is 
less  in  magnitude  than  the  intrinsic  radiances  for  the  following  reasons: 

a.  geometrical  attenuation  according  to  the  inverse  square  of  the  dis¬ 
tance  between  target  and  detector 

b.  atmospheric  absorption  and  scattering 

c.  obscuration  due  to  smoke,  clouds,  dust,  etc. 

d.  orientation  of  the  missile  relative  to  the  ground  plan 

The  synthetic  IR  imaging  of  a  target  in  a  background  will  be  accomplished  by 
the  SCNGEN  computer  program  (4) . 

The  terrain  background  is  represented  by  several  textures  such  as  trees, 
grass,  roads,  etc.  In  order  to  construct  a  scene  the  target  must  be  located 
within  the  background,  and  the  detector  position,  viewing  direction,  and 
orientation  must  be  specified.  The  angular  field  of  view  of  the  detector  de¬ 
termines  the  footprint  on  the  ground  plane  which  is  centered  about  the  pierce 
point  of  the  line-of-sight  between  sensor  and  ground  plane. 
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The  target  surface  is  approximated  by  planar  triangular  facets.  Each 
facet  is  assigned  a  value  of  intrinsic  radiance  by  the  TSTM  computer  program. 
As  discussed  earlier  in  this  paper  the  intrinsic  radiance  values  depend  on  the 
orientation  of  the  facets  relative  to  the  sun.  Facets  not  visible  from  the 
sensor's  location  and  orientation  are  systematically  eliminated  by  SCNGEN.  If 
more  than  one  target  is  considered,  additional  targets  are  implanted  so  that 
occlusion  of  one  target  by  another  is  accomplished.  The  effects  of  transmis¬ 
sion  of  the  IR  image  through  the  atmosphere  are  handled  by  a  version  of  the 
LOWTRAN  4  computer  program. 

NUMERICAL  RESULTS.  Numerical  calculations  have  been  done  for  the  surface  tem¬ 
peratures  and  intrinsic  radiances  of  the  facets  of  a  petroleum  storage  tank 
(Figure  4a).  A  petroleum  storage  tank  is  constructed  of  steel  plates  whose 
surfaces  are  painted  or  rusted  to  some  degree.  There  is  sometimes  an  airspace 
over  the  petroleum,  and  heat  conduction  in  both  petroleum  and  air  was  con¬ 
sidered.  Table  4  gives  the  thermal  conductivities,  diffusivities,  and  spe¬ 
cific  heats  of  air,  petroleum*  and  soil,  and  also  gives  the  LW  emissivities 
and  SW  absorptivities  of  the  surface  coatings  considered  for  this  numerical 
study  (3,9). 

Figure  4b  shows  a  typical  diurnal  variation  of  soil  temperature  and  air 
temperature.  Figure  4c  shows  horizontal  facet  temperatures  for  the  top  of  a 
petroleum  storage  tank,  and  Figures  4d  and  4e  give  the  diurnal  variation  of 
vertical  facet  temperature  including  the  dependence  on  the  azimuth  angle  of 
the  facets.  Figure  5a  shows  a  typical  diurnal  variation  of  the  radiance  of  a 
soil  background,  while  Figure  5b  gives  the  radiance  of  a  horizontal  facet  of  a 
tank.  The  diurnal  variation  of  the  vertical  facet  radiances  and  their  de¬ 
pendence  on  azimuth  are  given  in  Figures  5c  and  5d.  Figures  5e  through  6b 
give  examples  of  the  expected  intrinsic  radiance  contrast  for  the  facets  of  a 
petroleum  storage  tank. 

Figures  4e  through  6b  show  that  the  predicted  surface  temperatures  ad¬ 
jacent  to  the  airspace  and  to  the  petroleum  are  nearly  the  same.  However, 
the  difference  in  the  values  of  the  thermal  inertia  (density  times  specific 
heat)  of  air  and  petroleum  suggests  that  a  large  temperature  difference  and 
phase  lag  should  exist  between  the  air  and  petroleum  temperatures.  Prelim¬ 
inary  experimental  results  suggest  that  this  is  the  case.  The  TSTM  computer 
program  is  totally  insensitive  to  the  thermal  diffusivity  (thermal  inertia) 
of  the  working  material  (3).  This  represents  an  unphysical  prediction  of  the 
TSTM  computer  program  and  is  the  reason  for  the  nearly  identical  thermal 
response  of  air  and  petroleum  presented  in  this  paper. 

The  calculation  of  apparent  target-background  radiance,  as  would  be  seen 
by  an  aircraft  IR  sensor,  is  in  progress  using  the  SCNGEN  computer  program. 
When  completed  an  experimental  validation  program  will  be  carried  out  by  com¬ 
paring  the  predicted  apparent  radiances  (IR  scene)  of  a  petroleum  storage  tank 
with  the  digitized  experimental  values  of  the  apparent  radiances  as  given  in 
the  Target  and  Background  Information  Library  System  (TABILS)  database  (Fig¬ 
ure  6c) . 

CONCLUSION.  An  environmental  model  and  an  IR  image  formation  model  have  been 
developed  which  are  based  on  the  use  of  physical  principles  to  describe  the 
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basic  processes  involved  in  the  exchange  of  energy  between  the  target  and  the 
environment.  Changing  weather  conditions  affect  the  energy  exchange  and  alter 
the  appearance  of  a  target  in  a  background.  The  degree  of  contrast  of  the  IR 
image  of  a  target  in  a  background  will  vary  with  the  hour  of  day,  azimuthal 
angle  of  attack,  target  surface  conditions,  and  local  weather  conditions. 

This  degree  of  contrast  can  be  used  to  predict  the  performance  of  alternate  IR 
sensor  weapon  systems  against  a  specified  target  and  background  (Figure  6d) . 
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TABLE  1 


Army  Electro-Optical  Systems 


ARMY  E-0  DEPENDENT  SYSTEMS 
(OPTICAL) 


SYSTEM 

WAVEBAND.  Mm 

MIGHT  VISION  GOGGLM 

14-1S 

INDIVIDUAL  served  WEAPON  SIGHT 

14-11 

MBS/tt,  MJWEt  PemsCOPM 

SMI 

CHEW  SERVED  WEAPON  SIGHT 

14-1# 

HUMAN  EVE 

14-17 

T  '  to  BINOCULARS 

14-0.7 

DRAGON  OAT  SIGHT 

14-1? 

TOW  DAY  BIGHT 

14-47 

GUO  OAT  «OHT 

14-17 

— 

TAM  DAY  5IOMT 

14-17 

M-1  DAY  SIGHT 

14.17 

ITS  DAY  BIGHT 

14-17 

ARMY  E-0  DEPENDENT  SYSTEMS 

PERFORMANCE  OF 

(INFRARED) 

VISUAL  AND  INFRARED 
DETECTORS  -  WEST  GERMANY 

SYSTEM 

WAVEBAND,  am 

TOW  NIGHT  SIGHT 

7-j-ti 

TANK  TARGET  AT  Z7S0-M  RANGE 

QLLD  NIGHT  SIGHT 

7.5-12 

AND  CLOUD  CEILING  600-M 

NODLR  NIGHT  BIGHT 

7-S-12 

DRAGON  NIGHT  SIGHT 

7.5-12 

frequency 

OF  FAILURE 

TANK  THERMAL  SIGHT 

7.S-12 

DETECTOR  JAN  JUL 

THERMAL  IMAGING  SUBSYSTEM 

7.4-12 

NIGHT  CHAPARRAL  FUR  SUBSYSTEM 

7.S-12 

VISUAL  1  SB  0-64 

TARGET  ACQUISITION  DESIGNATION  SIGHT 

7,5-12 

INFRARED  Ml  0.13 

HANDHELD  THERMAL  VIEWER 

14- 15 

MILLIMETRE  7  7 

a. 

b. 

Thermodynamic  end  Radiometric 
Properties  of  Materials 


Thermodynamic  Properties* 


Material 

Thermal 
Conductivity 
(cal/cm-min*  K) 

Thermal 
Diffusivity 
(cm2 /min) 

Specific 
Heat.  C 
(cal/sm-OK 

Soil 

0.20 

0.3 

0.3 

Asphalt  Pavement 

0-106 

0.28 

0.3 

Water 

0.084 

0.084 

1.0 

Petroleum 

0.02 

0.033 

0,6 

Air 

0.0036 

12.9 

0.243 

Radiometric  Properties* 


WEST  GERMANY  CLOUD  COVER 


PERCENT  FREQUENCY 
CLOUD  COVER  SUMMER  WINTER 


CLEAR  TO  SCATTERED 
0-30*4 

30 

10 

BROKEN  TO  OVERCAST 
70-100* 

50 

70-80 

Material 

Rust 

White  paint 
Aluminum  paint 
Soil 

Asphalt  pavement 


Shortwave 

Absorptivity 

0.  9  A 
0.20 
0.20 
0.90 
0.90 


Longwave 

Emissivity 

0.90 

0.91 

0.40 

0.70 

0.96 


*  These  properties  may  vary  considerably  with  composition 
and  weather  conditions. 


c. 


d. 
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ENVIRONMENT 

1.  BACKGROUND  -  SOIL,  GRASS,  TREES, 

ROADS,  TOWNS,  CITIES 

2.  ATMOSPHERE  -  CLOUDS,  HUMIDITY, 

WIND  SPEED,  AIR  TEMPERATURE, 
SMOKE ,  DUST 

3.  CLIMATE  -  INSOLATION,  LATITUDE, 

SOLAR  ZENITH  AND  AZIMUTH  ANGLES 


a. 


WEAPON  SYSTEM  SENSORS 

t. 

RADIO  WAVES 

2. 

RADAR  (MICROWAVE) 

3. 

MILLIMETER  AND  SUBMILLIUETER 

4. 

INFRARED 

S. 

VISUAL 

a. 

ACOUSTIC 

7. 

SEISMIC 

b. 


HIGH  VALUE  TARGETS 

1.  TYPE 

INDUSTRIAL  COMPLEXES,  AIRFIELD  RUNWAYS, 
MISSILE  SHELTERS,  BUILDINGS,  DAMS ,  BRIDGES 

2.  TARGET  CHARACTERISTICS 
SURFACE  GEOMETRY  AND  ORIENTATION 
SURFACE  EMISSIVITY 

SURFACE  REFLECTIVITY 
SURFACE  TEMPERATURE 

MULTIPLE  TARGETS  -  SHADOWS  AND 
INTERACTIVE  EXCHANGE 


C. 


d. 


FIGURE  1 


SHORT  WAVE  INSOLATION 


b. 

FIGURE  2 
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REFLECTED  W  fMDMMCE 


TARGET  SURFACE  TEMPERATURE 


a. 


b. 


c.  d. 


e. 


FIGURE  5 
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TARGET  AND  BACKGROUND 

SIGNATURES  (SCENES) 

TOA 

EXPECTED  TARGET 

1.  INFRARED 

WHICH  WEAPON  SYSTEU 

APPEARANCE  FOR 

2.  RADAR 

HAS  BEST  TARGET- 

OPTIMAL 

3.  UIL  LINE  TER  AND 

SUB  MIL  UUE  TEH  WAVES 

BACKGROUND  CONTRAST 7 

WEAPON  SYSTEM 

4.  VISUAL 

1.  WEATHER 

I.  PILOT  BRIEFING 

S.  ELECTRO-ACOUSTIC 

a.  t me  of  day 

2.  MISSILE  COMPUTER 

6.  ELECTRO-SEISMIC 

c. 
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ORTHOGONAL  SCHEMES  FOR  STRUCTURAL 
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Abstract,  Historically  there  are  two  principal  methods 
of  matrix  structural  analysis,  the  displacement (or  stiffness) 
method  and  the  force  (or  flexibility)  method.  In  recent  times 
the  force  method  has  been  used  relatively  little  because  the 
displacement  method  has  been  deemed  easier  to  implement  on 
digital  computers,  especially  for  large  sparse  systems.  The 
force  method  has  theoretical  advantages,  however,  for  multiple 
redesign  problems  or  nonlinear  elastic  analysis  because  it 
allows  the  solution  of  modified  problems  without  restarting  the 
computation  from  the  beginning.  In  this  paper  we  give  an 
implementation  of  the  first  phase  of  the  force  method  which  is 
numerically  stable  and  preserves  sparsity.  A  primary  feature 
of  our  work  is  the  development  of  an  efficient  algorithm  for 
computing  a  banded  basis  for  the  null  space  by  orthogonal  de¬ 
composition.  Numerical  test  comparisons  for  several  practical 
structural  analysis  problems  are  provided. 

Key  Words :  structural  optimization,  force  method, 
orthogonal  factorization,  Givens  rotations,  tumback-QR  method. 


1.  Introduction.  Given  the  external  loads  on  a  structure,  the  object  of 
structural  analysis  is  to  determine  the  resulting  internal  forces,  stresses,  and 
displacements.  The  solution  to  this  problem  is  provided  by  a  variational  principle 
(minimization  of  energy)  subject  to  the  linear  elastic  relationships  among  the 
nodes  and  elements  of  the  finite  element  model  of  the  structure.  Either  the  forces 
or  the  displacements  may  be  taken  as  the  primary  quantities  to  be  computed, 
and  the  other  can  then  be  determined  as  a  by-product.  These  two  approaches  give 
rise  to  the  force  (or  flexibility)  method  and  the  displacement  (or  stiffness) 
method,  respectively.  In  recent  times  the  displacement  method  has  predominated, 
largely  because  it  is  easier  to  implement  on  digital  computers,  especially  for 
large  sparse  systems,  and  makes  use  of  well  established  techniques  of  numerical 
linear  algebra.  The  displacement  method  can  be  inefficient,  however,  for 
structural  optimization  problems  in  which  a  sequence  of  related  structural 
analysis  problems  must  be  solved  (e.g.,  problems  having  a  fixed  layout  but 
differing  material  properties).  The  force  method  is  then  preferable  because  it 
utilizes  a  portion  of  earlier  computations  in  order  to  solve  such  modified  problems 
without  starting  the  computations  over  from  the  beginning.  Unfortunately,  most 
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implementations  of  the  force  method  have  suffered  from  excessive  fill  or  numerical 
difficulties,  or  both.  In  this  paper  we  give  an  implementation  of  the  first  phase 
of  the  force  method  which  is  numerically  stable  and  preserves  sparsity  for  large- 
scale  problems.  The  complete  implementation  will  be  published  elsewhere  [1  ]. 

Before  stating  our  problem  precisely,  we  need  to  develop  some  notation.  The 
notational  conventions  used  by  structural  engineers  and  by  numerical  analysts  are 
quite  different.  As  a  compromise,  we  will  use  the  same  letters  to  denote  various 
quantities  as  are  commonly  used  by  structural  engineers  (see,  e.g.,  [3]),  but 
we  will  retain  the  usual  convention  in  numerical  analysis  that  lower  case  letters 
represent  vectors,  while  upper  case  letters  represent  matrices.  Our  notation  is 
summarized  in  Table  1. 


equilibrium  matrix  :  E 
element  flexibility  matrix  :  D 
element  stiffness  matrix  :  D~] 
system  stiffness  matrix  ;  K—ED~l ET 
self-stress  matrix  :  B 
system  flexibility  matrix  :  F=*BTDB 
nodal  load  vector  :  p 
system  force  vector  :  / 
nodal  displacement  vector  :  r 
system  displacement  vector  :  v 
redundant  force  vector  :  x 


Table  I 


The  known,  defining  quantities  for  a  structural  analysis  problem  are  the 
equilibrium  matrix  E,  the  element  flexibility  matrix  D  (or  equivalently  the  element 
stiffness  matrix  D”^) ,  and  the  nodal  load  vector  p.  The  unknowns  to  be  determined 
are  the  system  force  vector  f,  the  system  displacement  vector  v,  and  the  nodal 
displacement  vector  r.  The  remaining  variables  in  Table  1,  such  as  the  redundant 
force  vector  x,  are  derived,  intermediate  quantities.  We  assume  that  E  is  an 
m  *  n  matrix  of  rank  m  and  that  D  is  a  symmetric  positive  definite  matrix  of 
order  n.  Here  D  is  block  diagonal  with  the  diagonal  blocks  having  orders  ranging 
from  one  to  six,  depending  on  the  finite  element  model  of  the  structure.  An 
example  of  a  two-dimensional  frame  and  its  equilibrium  matrix  are  shown  in 
Figs.  1  and  2. 


hio  I.  Twv-iUmtiutuaai  from*  *uh  titnum  ami  mttk  mumtxnag 


Fm  2.  Equilibrium  matrix  £  far  i+o-dlm*mkmml  frame 


The  linear  elastic  relationships  for  the  system  are 

1,  Equilibrium  Equation: 

Ef  -  p 

2 ,  Compatibility  Equation: 

T 

E  r  -  v 

3 ,  Material  Equation^ 

Of  -  v. 

The  equilibrium  and  compatibility  equations  represent  constraints  on  the  forces  and 
displacements,  respectively,  and  the  material  equation  represents  the  material 
characteristics. 
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The  fundamental  problem  of  structural  analysis  is  that  of  solving  the 
quadradic  programming  problem 

m^n  ^fTDf  subject  to  Ef  =  P*  (1.1) 

The  force  method  is  motivated  by  the  observation  that  problem  (1.1)  is  asking 
for  a  weighted,  minimum-norm  solution  to  the  equilibrium  equation.  Given  any 
particular  solution  s  to  such  an  under determined  system,  any  other  solution  can  be 
expressed  as  a  sum  of  s  and  a  solution  to  the  corresponding  homogeneous  system 
Ef  -  0.  Thus  we  may  write  the  system  force  vector  f  as 


f  =  s  +  Bx, 


(1.2) 


where  the  n  x  (n  -  m)  matrix  B,  called  the  self-stress  matrix,  is  chosen  so  that 
its  columns  form  a  basis  for  the  null  space  of  E.  Once  s  and  B  have  been  determined, 
we  then  need  to  determine  the  proper  linear  combination  of  the  columns  of  B, 
represented  by  the  vector  x,  so  that  f  solves  problem  (1.1).  The  force  methods  is 
usually  carried  out  in  two  phases: 

Phase  1:  Compute  a  particular  solution  s  of  the  equilibrium  equation,  together 
with  a  self-stress  matrix  B  such  that  EB  =  0. 

Phase  2:  Compute  the  redundant  force  vector  x  and  the  system  force  vector  f 
from  (1.2) . 

Since  Phase  1  depends  only  on  E  and  p,  it  need  be  executed  only  once  in  order  to  solve 
a  sequence  of  problems  which  differ  only  in  D,  with  just  Phase  2  being  repreated  for 
each  new  value  of  D.  In  order  for  the  force  method  to  be  viable  in  practice,  however, 
we  need  numerically  stable  algorithms  for  both  phases  which  preserve  sparsity. 

Phase  1  is  the  subject  of  this  report.  Phase  2  and  its  combination  with  Phase  1  will 
be  considered  elsewhere  [1].  In  the  following  section  we  describe  a  refined  method 
for  computing  a  banded  basis  matrix  B  for  the  null-space  of  E. 

2 .  Band  Schemes . 


For  a  general  sparse  equilibrium  matrix  E  it  is  difficult  to  say  anything  about 
the  structure  of  the  resulting  self-stress  matrix  B  other  than  that  it  might  be  quite 
full.  If  E  has  a  banded  structure,  however,  Topcu  [4]  and  Kaneko,  Lawo  and  Thierauf  [2] 
have  shown  how  by  elimination  methods  to  compute  a  B  which  also  is  banded.  By 
modifying  and  extending  their  algorithm,  we  will  show  how  by  orthogonal  methods  to 
produce  a  banded  self-stress  matrix  B.  Since  E  and  B  are  not  square  matrices,  perhaps 
we  should  first  point  out  that  we  do  not  mean  banded  with  respect  to  the  usual  main 
diagonal,  but  with  respect  to  a  line  from  the  upper  left  comer  to  the  lower  right 
corner  of  the  matrix.  Also,  for  our  purposes  the  distinction  between  band  and  profile 
(also  known  as  variable  band,  envelope,  or  skyline)  implementations  is  unimportant, 
and  so  we  will  use  the  shorter  term  "band"  to  mean  both  possibilities. 

We  now  give  a  refined  turnback  scheme,  developed  originally  by  Topcu  [4]  and  by 
Kaneko,  Lawo  and  Thierauf  [2],  for  computing  a  banded  basis  matrix  B  for  the  null-space 
of  a  banded  matrix  A. 

Given :  An  m  *  n  banded  matrix  A,  with  full  row  rank  m. 

Purpose:  To  compute  an  n  x  p,  p  *  n  -  m,  banded  matrix  B,  with  a  small  profile, 
whose  columns  form  a  basis  of  the  null  space  of  A;  that  is  AB  =  0, 
rank  B  =»  p  . 

Notation:  A^  ,  B^  denote  the  j1"*1  columns  of  A  and  B  respectively. 
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Seep  1:  Construct  a  sequence  b  <  b*  <  ...  <  b  .  where  b,  denotes  the  row  index 

}  P  J 

for  the  last  nonzero  entry  in  column  B  of  such  a  B,  for  j  *  1,  p  . 

Eat  ^enote  the  first  m  columns  of  A.  We  determine  the  sequence  of  column 
indices  c^  <  c.^  <  ...  <  cg#  if  any,  where  column  dependencies  (’’zero’1  pivots)  occur 

when  is  reduced  to  upper  triangular  form.  Here  either  Gaussian  elimination  with 

row  pivoting  (or  orthogonal  reduction  by  Givens  rotations)  is  applied  to  A  ,  In 
particular,  define,  for  t  •  1,  . s,  m 


ct  ^ 


min  k 


1  <  k  <  m  and 


{»n> . *<k,]\[»<ii> . *<vi>] 


is  a  linearly  dependent  set. 


(Here 


A 


<ci> 


(c 


A 


-]  ■ 


is  missing  if  t  «  1.) 


Now  we  define  the  row  indices  b.  as  follows.  For  j  =  1,  p, 


b 


J 


c^  if  1  _<  j  <_$ 
m  +  j  otherwise 


Step  2:  (Turnback  step).  For  j  =  1 , 

(1)  Set 


P> 


consturct  the  column  B 


(j) 


as  follows: 


1  if  b.  <  m 

J  ~ 


b.  -  in  if  b  >  m 
J  J 


and  define  a  matrix  of  columns  from  A  as  follows: 


(b.)  (b.-l) 

E.  =  [E  J  I  E  J 
J  1 


,(t) 


where  the  columns  of  E_.  are  columns  of  A,  beginning  with  the  b . th  column  E 

in  decreasing  column  subscript  order.  More  specifically  the  columns  of  E.  are  the  columr 
vectors  j 


GO  (b.) 


J  , 


=  A 
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[*‘V . “]\[iltl> . ***' 

in  decreasing  column  subscript  order,  where 

if  j  =  1,  where  the  t?s  are  given  by  (2.1)  below. 


[/'» . >*’}.. 


(2)  Apply  either  Gaussian  elimination  with  row  pivoting  or  orthogonal 
reduction  by  Givens  rotations  to  ,  producing  an  upper  trapezoidal  matrix 


Ej  -  [E 


.00 


where 


t.  =  max 
J 


At£) 


(bJ 


k  <  b .  and 
3 


,  ....  E 


(t 


'*] 


is  a  linearly  dependent  set. 

A 

Then  necessarily  E,  is  ra  x  (b.  -  t.)  and  has  the  form 
3  3  3 


E .  = 
3 


U  E 

0  0 


n 

s  J 
) 


where  U  is  square  of  order  b.  -  t.  -  1,  upper  triangular  and  nonsingular 

3  3 

(the  zero  blocks  may  be  missing). 


(3)  Solve  the  triangular  system 
*(t4) 


Uy  =  -E 


3 


by  back  substitution  for  y-(y),l<s<b.-t.  -1, 

s’—  '33 


(4)  Define  the  column  "(B  ^)  of  B  as  follows 

<V  a) 

1  if  E  J  is  column  Av  ;  of  A 


a)  =  l 


y  if  is  column  A^  of  A,  1<  s  <  b.  -  t.  -1 

s  “  -  J  3 


0  otherwise. 


a 


missing 


(2.1) 
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3.  Numerical  Comparisons  for  Phase  1 


3 , 1  Structural  Test  Data: 

In  addition  to  the  two  dimensional  frame  example  described  in  section  1,  five 
other  structures  provided  a  means  for  preliminary  testing  of  the  orthogonal  turn¬ 
back  algorithm.  Some  preliminary  comparisons  were  made  between  the  orthogonal 
and  the  Gaussian  elimination  implementations  of  the  algorithm*  Each  of  the  examples 
described  below  was  kindly  provided  by  M.  Lawo  of  Essen,  Germany* 

Example  2 : 

Three  Dimensional  Frame. 

The  finite  element  model  has  16  nodes  and  24  elements  and  E  is  72  x  144. 


Example  3 : 

Plane  Stress  Problem  (Slab) . 

The  finite  element  model  has  25  nodes  and  16  elements  and  E  is  40  x  80. 
Example  4: 

Plate  Bending  Problem. 

The  finite  element  model  has  25  nodes  and  16  elements  and  E  is  59  *  144. 


Example  5 : 

Plane  Frame  Problem  (Wheel) . 

The  finite  element  model  has  32  nodes  and  40  elements  and  E  is  96  x  120. 


Example  6: 

Plane  Stress  Problem  (Wrench) . 

The  finite  element  model  has  57  nodes  and  48  elements  and  E  is  112  *  216.  A 
distinguishing  feature  here  is  a  "hole”  in  the  model.  Also,  three  different 
node  and  element  orderings  were  tested  for  this  model:  block  angular,  minimal 
bandwidth  and  minimal  skyline. 
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3.2  Numerical  Comparisons .  Some  preliminary  comparisons  were  performed 
using  each  of  the  six  structural  analysis  problems  just  described.  In  each 
example,  the  banded  null-space  basis  matrix  Z  for  the  equilibrium  matrix  was 
computed  by  CD  the  turnback  algorithm  implemented  with  Gaussian  elimination 
and  (2)  the  turnback  algorithm  implemented  with  orthogonal  transformations  by 
Givens  rotations.  The  basic  turnback  algorithm  in  our  refined  form  was  given 
in  Section  2. 

All  the  software  for  this  project  was  written  at  North  Carolina  State 
University  in  FORTRAN  and  executed  on  the  large-scale  IBM  3081  at  the 
Triangle  Universities  Computing  Center  (TUCC).  Each  problem  was  tested  in 
IBM  double  precision.  IBM  single  precision  was  also  used  for  certain  of 
the  problems  in  order  to  obtain  a  better  basis  for  comparison.  Various 
tolerances  were  tested  for . declaring  a  floating  point  zero  pivot,  i.e.,  a 
rank  dependency  among  the  columns  of  E.  We  settled  on  the  tolerance  10-6 
since  the  nonzero  elements  of  E  generally  had  magnitude  approximately  1  the 

data  was  given  to  six  digits.  This  tolerance  worked  quite  well  In  the  computations. 

We  now  give  some  observations  on  the  performances  of  our  software  on  these 
examples.  In  order  to  obtain  some  measure  of  the  accuracy  of  a  given  method 
for  computing  a  basis  matrix  B  for  the  null-space  of  the  equilibrium  matrix  E, 
the  Frobenius  norm,  | |EB| |f  was  calculated.  In  what  follows,  B  will  denote  a 

basis  matrix  computed  by  the  back  substitution  method  and  Z  will  denote  a  basis 
matrix  obtained  by  the  refined  turnback  algorithm. 


1.  In  each  case  ||eb||f  was  smaller  than  ||EZ||f*  In  fact,  the  first  norm  was 


2ero  for  the  frame  examples.  Examples  1  and  2.  Very  little  difference  in 
accuracy  was  observed  between  the  Gaussian  elimination  and  orthogonal  factori¬ 
zation  implementations  of  the  refined  turnback  method.  Essentially, 


-6 

10  in  single  precision 
in  double  precision 


2.  Surprisingly,  the  orthogonal  transformation  implementation  of  the  refined 
turnback  algorithm  executed  faster  than  the  Gaussian  elimination  implementation. 
We  suspect  that  this  situation  is  primarily  due  to  the  fact  that  row  pivoting 
is  used  with  Gaussian  elimination  while  no  pivoting  is  necessary  for  stability 
with  Givens  rotations.  Such  pivoting  can  consume  an  inordinate  amount  of  time 
in  the  lengthy  turnback  phase  of  the  algorithm. 

The  computations  we  have  performed  thus  far  are  preliminary  in  nature,  but 
on  the  basis  of  the  results  discussed  above,  the  orthogonal  factorization 
implementation  of  the  refined  turnback  algorithm  appears  to  be  an  attractive 
method  for  computing  a  banded  basis  for  the  null-space. 

Further  refinements  of  these  methods  followed  by  numerical  tests  will  be 
made  in  the  near  future  under  grants  from  other  funding  agencies . 
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ABSTRACT 


This  study  presents  the  results  of  a  three  dimensional  elastic- 
plastic  dynamic  stress  analysis  of  one  of  the  structural  joints  encoun¬ 
tered  in  artillery  projectiles.  The  particular  spline  joint  analyzed 
has  equally  spaced  set  screws  around  the  surface  of  the  projectile.  In 
general,  these  types  of  structural  joint  problems  are  variable  contact 
problems  in  that  the  interaction  between  the  set  screws  and  their  bearing 
surface  along  with  the  interaction  between  the  interfaces  of  the  joint 
are  nonlinear  in  nature.  Due  to  the  complexity  of  the  structural 
configuration  and  loadings  of  the  joint,  the  finite  element  method  has 
been  used  to  solve  the  problem.  The  numerical  analysis  covers  the  time 
from  initial  launching  to  barrel  exit.  Stresses  and  deformations  in  the 
joint  are  determined  at  various  stages  of  loading.  The  effect  of  the  set 
screws  and  set  screw  holes  on  the  stress  distributions  in  the  joint  is 
examined  in  detail. 

INTRODUCTION : 

Artillery  projectiles  are  subjected  to  extremely  high  loads  during 
firing.  At  present,  the  design  of  artillery  projectiles  is  greatly 
facilitated  by  the  exploitation  of  the  finite  element  method.  But  the 
application  of  the  method  has  been  limited  to  the  simplified  two  dimen¬ 
sional  or  axisymmetric  analysis.  This  is  due  to  the  complexity  of  the 
geometries  of  the  projectiles  thus  requiring  very  long  computer  analysis. 
Although  successes  were  achieved  in  the  previous  designs,  there  have 
always  been  concerns  of  the  structural  integrity  of  the  projectiles  with 
the  presence  of  high  local  stresses.  In  the  case  of  XM753  projectiles, 
high  local  stresses  appear  in  the  region  of  the  pinned  joint.  In  the 
case  of  XM735  projectiles  the  use  of  spline  joint  with  set  screws  in  the 
design  also  leads  to  high  local  stresses  in  the  joint  area. 

After  firing,  an  artillery  projectile  is  subjected  to  various  loads 
continuously  changing  with  time.  It  experiences  first  a  very  high 
compressive  load  in  the  axial  direction  during  in  bore  flight  and  then  a 
high  tensile  load  at  barrel  exit.  Such  a  loading  history  causes  the 
projectile  to  undergo  a  stress  reversal  i.e.  from  a  stress  state  of 
compression  to  that  of  tension.  However  it  has  been  a  common  practice  in 
the  design  of  artillery  projectiles  to  perform  either  one  or  two  simple 
independent  stress  analysis.  The  one  which  is  generally  carried  out  is 
a  two  dimensional  quasi-static  analysis  in  which  the  loads  at  the  peak 
linear  acceleration  during  in  bore  flight  are  used  to  compute  the 
stresses  and  deformations  in  the  projectile. 
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Sometimes  a  two  dimensional  dynamic  analysis  is  also  conducted  to 
determine  the  stresses  and  deformations  in  the  projectile  caused  by 
the  sudden  drop  off  of  pressure  of  the  propellant  gas  at  barrel  exit. 

In  the  present  investigation  a  detailed  three  dimensional  elastic- 
plastic  dynamic  stress  analysis  of  a  spline  joint  of  XM785  projectile 
is  performed.  The  purposes  of  the  study  are:. 

(1)  to  verify  the  structural  adequacy  of  a  proposed  design. 

(2)  to  asses  the  effect  of  set  screws  and  set  screw  holes  on  the 
stress  distributions  in  the  joint  region  of  the  projectile. 

In  addition  the  present  work  also  serves  as  an  initial  effort  to 
determine  the  extent  of  the  influence  of  the  stress  reversal  or 
Bauschinger  effect  on  the  structural  integrity  of  the  projectile. 

DESCRIPTION  OF  LOADING  CONDITIONS 


After  a  projectile  is  fired  and  before  it  departs  the  gun  barrel 
it  is  subjected  to  a  combination  of  the  following  loads: 

(1)  axial  compressive  load  due  to  linear  acceleration  of  the 
pro j  ect i 1 e 

(2)  centrifugal  load  due  to  angular  rotation  of  the  projectile. 

(3)  torsional  load  due  to  angular  acceleration  of  the  projectile. 

(4j  internal  load  due  to  interaction  of  interior  components  and 
proj  ectile. 

(5)  external  load  due  to  gun  tube  constraint,  rotating  band  pressure 
and  balloting. 

As  the  projectile  departs  the  gun  barrel,  it  experiences  a  tensile  load 
or  a  negative  set-back  load  (elastic  release)  in  the  axial  direction 
resulting  from  the  sudden  drop  off  of  propellant  pressure  at  the  barrel 
exit.  Among  the  loads  the  axial  load  is  the  dominating  one. 

It  has  been  a  general  practice  to  omit  the  effect  of  the  torsional 
load  induced  by  the  angular  acceleration  in  the  design  of  the  projectiles. 
Analysis  has  shown  that  as  a  result  of  such  an  omission,  the  magnitude  of 
the  effective  stresses  in  the  projectiles  are  about  2%  -  4%  lower  (1). 
Since  the  present  investigation  concerns  the  determination  of  the  stress 
distributions  in  the  region  of  the  joint,  only  the  portion  of  the 
projectile  in  the  neighborhood  of  the  joint  is  considered.  The  area  where 
the  rotating  band  is  located  is  excluded.  Thus  the  load  due  to  rotating 
band  pressure  is  not  included  in  the  analysis.  Only  the  axial,  centrifugal 
and  internal  loads  during  in  bore  flight  and  the  negative  set  back  load  at 
barrel  exit  are  considered  in  the  three  dimensional  dynamic  stress 
analysis  of  the  joint.  Fig.  1  shows  the  linear  acceleration  of  the 
projectile  used  in  this  analysis  for  the  calculation  of  the  axial  load. 

The  projectile  reaches  an  acceleration  of  I7,000g  in  about  6  milliseconds, 
zero  acceleration  at  barrel  exit  and  immediately  is  subjected  to  a 
negative  acceleration  of  2000g. 
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METHOD  OF  STRESS  ANALYSIS 


Figs.  2  and  3  shows  respectively  the  geometric  configuration  of 
an  artillery  projectile  and  a  typical  section  of  its  joint.  The  latter 
is  formed  by  passing  a  plane  at  the  midpoint  of  two  neighboring  set 
screws  and  another  plane  through  the  center  of  one  of  the  set  screws. 

The  two  planes  are  parallel  to  the  axis  of  the  projectile. 

In  the  initial  phase  of  the  analysis,  the  finite  element  model  of 
the  joint  (see  Fig.  4)  was  created  by  using  the  computer  program  PATRAN-G 
(2).  It  employs  sophisticated  interactive  color  graphics  and  a  powerful 
geomet ry-based  language  for  geometry  construction  and  finite  element 
modeling  during  the  first  phase  of  the  program.  Then  nodal  point  and 
finite  element  generation,  assignment  of  physical  properties,  application 
of  external  loads  and  definition  of  sliding  interfaces  are  accomplished 
in  the  second  phase  of  the  program. 

The  explicit  three  dimensional  finite  element  code  DYNA3D  (3)  was 
used  to  compute  the  stresses  and  deformations  in  the  finite  element 
model  of  the  joint.  DYNA3D  is  designed  to  analyze  the  large  deformation 
dynamic  response  of  inelastic  solid s.  It  has  a  contact  algorithm  that 
can  model  gaps  and  sliding  materials  interfaces.  It  uses  a  8-node  constant 
stress  solid  element  and  one  point  integration  in  element  stiffness 
calculations.  It  is  programmed  to  take  full  advantage  of  vector 
optimization  on  the  CRAY-1  (a  class  VI  machine)  and  can  execute  at  less 
than  0.67CPU  (central  processor  Units)  minutes  per  million  mes'h  cycles. 

A  symmetric,  penalty  based,  CDnt act  -  imp  act  algorithm  was  implemented 
that  not  only  reduced  hourglassing  problems,  but  also  was  considerably 
faster  in  execution  speed  and  exceedingly  reliable. 

There  were  1773  eight  node  brick  elements  and  2611  nodes  in  the 
finite  element  model  of  the  joint.  Four  sets  of  gap  or  sliding  interfaces 
were  required  in  the  model.  Three  sets  of  sliding  interfaces  were 
used  to  model  the  contact  between  the  case  structure  and  the  wedge.  A 
sliding  interface  with  small  initial  gap  was  used  to  model  the  interaction 
between  the  set  screw  and  the  hole. 

The  computations  were  performed  on  a  CRAY-1  computer  at  Sandia 
National  Laboratories.  The  computer  time  required  to  complete  the  dynamic 
analysis  was  about  2,8  CPU  hours. 

DISCUSSION  OF  RESULTS 


The  post  processor  program  GRAPE  (4)  was  used  to  obtain  plots  of 
stress  contours  on  the  surfaces  of  the  joint  and  the  deformed  shapes  of  the 
joint  at  the  time  when  the  maximum  linear  acceleration  is  reached  and 
also  at  barrel  exit.  These  are  shown  in  Figs.  5  to  11.  Examination  of 
the  results  of  the  analysis  leads  to  the  following  findings:' 

(1)  Due  to  the  presence  of  set  screws,  a  larger  portion  of 

the  axial  compressive  load  is  transmitted,  during  in-bore 
flight,  through  the  center  sections  of  the  regions  bounded 
by  each  pair  of  set  screws.  Consequently,  the  center 
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sections  experience  higher  stresses.  This  is  different 
from  the  uniform  stress  distribution  in  the  circumferential 
direction  found  in  the  two  dimensional  stress  analysis . 

2.  Higher  stresses  occur  on  the  outer  surfaces  of  the  case 
structure  and  the  wedge  at  both  peak  axial  compression 
and  tension  at  barrel  exit  due  to  bending  effect. 

3.  Local  yielding  occurs  in  the  set  screws,  set  screw  holes 
and  other  areas  in  the  case  structure  and  the  wedge  at 
time  of  peak  axial  compression  and  that  of  tension  at 
barrel  exit.-  Initial  examination  of  the  plastic  strains 
(or  Bauschinger  effect)  at  peak  axial  compression  leads 
one  to  believe  that  they  have  a  negligible  effect  on  the 
level  of  stresses  in  the  joint  at  barrel  exit.  This  is 
because  their  magnitudes  are  small  (.001)  and  their 
locations  are  also  different  from  the  plastic  strains 
induced  by  the  tensile  load  or  negative  set-back  at 
barrel  exit. 

4.  Similar  stress  distributions  were  obtained  when  a  wedge 
made  of  either  steel  or  titanium  was  used  with  a  titanium 
case  structure . 

In  summary,  a  detailed  three  dimensional  elastic-plastic  dynamic 
stress  analysis  of  an  artillery  projectile  was  obtained  to  determine 
the  adequacy  of  a  proposed  design.  The  analysis  was  performed  by 
using  the  DYNA3D  finite  element  program.  The  results  of  the  present 
analysis  will  be  verified  by  laboratory  test  currently  under  preparation 
at  AMMRC.  Future  analytical  work  will  include  application  of  fracture 
mechanics  to  the  problem  and  accurate  determination  of  the  Bauschinger 
effect  by  using  actual  stress  strain  curves  of  the  materials.  The 
present  analysis  employed  isotropic  strain  hardening  which  does  not 
accurately  represent  the  material  behavior  in  a  loading  condition  where 
there  is  a  reversal  of  applied  loads. 
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See  Table  1  for  Values  of  Symbols 
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Fig.  5  Axial  Stresses  in  the  Case  Structure 

Case  1:  At  time  of  maximum  compression 
Case  2:  At  time  of  barrel  exit 
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Fig.  6  Effective  Stresses  in  the  Case  Structur 
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Fig.  7  Effective  Plastic  Strains  in  the  Case  Structure 
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Fig.  9  Effective  Stresses  in  the  Wedge 
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Fig.  11  Deformed  Shape  of  Finite  Element  Model 
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Levels  Of  Stresses  and  Strains 
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A  simple  approach  for  determination  of 


BURSTING  PRESSURE  OF  A  THICK-WALLED  CYLINDER 

SHIH  C.  CHU 
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FIRE  CONTROL  &  SMALL  CALIBER  WEAPON  SYSTEMS  LABORATORY 
U.S,  ARMY  ARMAMENT  RESEARCH  AND  DEVELOPMENT  COMMAND,  DOVER,  NJ  07801 

ABSTRACT 

By  using  the  true-stress  true-strain  relation,  a  simple  finite-strain  incom¬ 
pressible  analytical  solution  technique  is  developed  to  predict  pressure- 

deformation  relations  up  to  the  failure  of  thick-walled  cylinders  subjected  to 
internal  pressure.  The  material  is  assumed  to  be  an  isotropic  hardening  material 
that  obeys  the  von  Mises  yield  condition.  The  flow  law  for  the  material  incor¬ 
porates  the  Prandtl-Reuss  stress-strain  relations  and  a  loading  function  repre¬ 
sented  by  the  true-stress  versus  true-strain  diagram*  Poisson's  ratio  is  assumed 
to  be  equal  to  one-half  for  both  elastic  and  plastic  strains  1  The  bursting  pres¬ 
sure  for  thick-walled  cylinders  made  of  either  SAE  1045  steel  or  copper  is 

presented  graphically  for  various  ratios  of  outer  and  inner  radii  of  undeformed 

cylinders.  The  true  location  of  inner  surface  of  cylinders  made  of  both  metals 
at  various  pressure  levels  was  determined  and  presented  graphically.  Pressure- 
expansion  curves  for  cylinders  made  of  SAE  1045  steel  and  OFHC  copper  are 

presented. 
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NOMENCLATURE 


true-stress  components 
true-strain  components 

effecive  true  stress  and  effective  true  strain 

inner  and  outer  radii  of  the  undeformed  thick-walled  cylinder 

inner  and  outer  radii  of  the  deformed  thick-walled  cylinder 

variable  radius  of  deformed  thick-walled  cylinder 

variable  radius  of  undeformed  thick-walled  cylinder 

r-rQ  is  the  radial  displacement 

internal  and  external  pressures 

Young’s  modulus 

Poisson’s  ratio 

cr0]L  is  yield  stress 

stress  at  intersection  of  equations  3  and  4 

aj E  is  yield  strain 

ratio  of  se  to  tQ  at  r-r1 

ratio  of  ez  to  £e(r:nr  ) 

r  0  2/ r  0 1  radius  ratio 

ultimate  tensile  stress 

bursting  pressure 


500 


INTRODUCTION 


Thick-walled  cylinders,  such  as  gun  tubes  and  pipes  to  hydraulic  presses, 
often  are  required  to  resist  internal  pressures  that  will  cause  inelastic 
strains.  It  is  important  for  the  engineer  to  have  a  relatively  reliable  and 
simple  solution  technique  that  can  be  used  to  predict  the  load-deformation  rela¬ 
tions  for  thick-walled  cylinders  subjected  to  internal  pressure  up  to  fracture 
load  encountered#  This  information  would  enable  the  engineer  to  determine  the 
factor  of  safety  of  his  design  against  failure  by  fracture.  In  the  case  of  auto- 
fret  tage,  it  would  enable  the  engineer  to  determine  safe  pressures  to  be  used  or 
to  determine  safe  limits  for  the  deformations.  Partially  plastic,  thick-walled 
cylinders  have  been  investigated  by  many  investigators  [1-6],  In  all  the  solu¬ 
tions  [1-6],  it  has  been  assumed  that  subsequent  to  the  initial  yield  the  elastic 
plastic  interface  is  cylindrical  and  concentric  with  the  bore.  This  assumption 
is  correct  in  many  cases  but  not  correct  for  a  material  with  a  large  drop  of 
stress  at  yield.  In  addition  most  of  the  solutions  are  based  on  the  total  strain 
theory,  though  Taylor  has  pointed  out  that  the  incremental  theory  should  be  used. 
However,  the  total  strain  theory  gives  a  solution  which  is  only  slightly  in  error 
in  this  particular  problem.  The  governing  equations  have  been  derived  for  par¬ 
tial  yield  thick-walled  cylinders,  assuming  the  dimensional  changes  are  negligi¬ 
ble,  but  in  the  region  of  the  bursting  pressure  the  strains  are  considerably 
large  and  this  assumption  is  no  longer  valid. 

Based  on  the  closed-ended  cylinder  condition,  bursting  pressure  was  Investi¬ 
gated  by  Manning  and  Chem  [7]  and  Crossland  and  Bones  [8],  In  their  approach  the 
plastic  strain  in  the  axial  direction  is  assumed  to  be  zero.  Experimental  data 
for  closed-ended  cylinders  [9]  indicates  that  the  assumption  is  reasonable. 
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Some  empirical  equations  for  bursting  pressure  of  thick-walled  cylinders  have 
been  proposed  by  Faupel  and  Furbeck  [10]  and  Iterson  [11]#  Those  empirical  equa¬ 
tions  are  derived  either  based  on  experimental  data  or  on  intuition*  The  well- 
known  mean-diameter  formula  is 


P 


u 


2f 

u  Ti?ry 


Cl) 


A  finite  total-strain  incompressible  analytical  solution  is  presented  for 
thick-walled  cylinders  subjected  to  internal  pressure*  Using  the  strain  calcula¬ 
tions,  the  geometry  of  the  deformed  cylinder  is  calculated  at  the  end  of  each 
load  increment.  Since  the  deformed  geometry  of  the  cylinder  is  used  in  making 
the  calculations  of  the  strain  increments  for  each  load  increment,  the  computed 
strains  are  true  strains  and  the  solution  is  yalid  for  finite  strain.  One  might 
question  the  advisability  of  considering  a  solution  not  readily  acceptable  to 
experts  in  the  field  of  plasticity  when  an  incremental  compressible  solution  is 
available.  The  total-strain  solution  was  used  because  it  was  inexpensive  to  use 
and  the  computer  program  did  not  have  convergence  problems.  On  the  other  hand, 
the  computer  program  for  the  incremental  solution  would  not  converge  unless  the 
plastic-strain  increment  for  each  increment  of  load  was  small.  In  order  to 
obtain  a  compressible  incremental  solution  it  was  necessary  to  make  the  load 
increments  smaller  and  smaller  as  the  plastic  strains  increased  in  magnitude. 
Because  of  the  convergence  problems  and  the  excessive  computer  cost  for  imple¬ 
menting  the  compressible  incremental  theory,  the  present  solution  technique  is 
presented. 
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GOVERNING  EQUATIONS 


In  this  investigation,  the  material  is  assumed  to  be  an  isotropic  hardening 
material  that  obeys  the  von  Mises  yield  condition.  Poisson's  ratio  is  assumed  to 
be  equal  to  one-half  for  both  elastic  and  plastic  strains. 

Cylindrical  coordinates  are  used  with  z  being  the  axial  direction  and  r  and  0 
being  the  radial  and  circumferential  directions,  respectively.  The  nonzero 
stress  and  strain  components  are  <?r,  <jQ>  az »  tQz,  ef ,  eQ,  ez  and,  For  the 
thick-walled  cylinder  subjected  to  pressure,  the  effective  stress  aG  and 
effective  strain  ee  are  defined  by  the  relations 


V2  +  (Vaz)2  +  (az'°r)2 


(2) 


VT 


-  3 


\/(VEe)2  +  (Ve*)2  +  <Ez"Er)2 


(3) 


The  flow  law  for  the  material  incorporates  the  Prandtl-Reuss  stress-strain 
relations  and  a  loading  function  represented  by  a  tension  true-stress  versus 
true-strain  diagram.  The  loading  function  for  the  analytical  solutions  is 
obtained  from  a  tension  specimen  whose  effective  true-stress  ( a^)  versus  effec¬ 
tive  true-strain  (ee)  diagram  is  approximated  by  a  finite  number  of  straight 
lines.  The  straight  line  through  the  origin  is  given  by  the  relation 

ae  *  Eee  (4) 
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where  E  is  Young’s  modulus*  All  of  the  straight  lines  are  given  by  the  relation 

°e  “  (1-mi)ooi  +  miEse  (5) 

where  aoi  is  the  stress  at  the  intersection  of  the  two  straight  lines  given  by 
eqs.  (4)  and  (5)  and  m^E  is  the  slope  of  the  straight  line  given  by  eq.  (5). 

The  condition  of  incompressibility  for  the  cylinder  is  given  by 

^  +  £0  +  ez  =  0  (6) 


The  equation  of  equilibrium  for  the  thick-walled  cylinder  is  given  by  the 
relation 


VCr 


(7) 


where  r  is  the  variable  radius  for  the  deformed  cylinder.  A  point  at  radius  rQ 
in  the  undeformed  cylinder  has  a  radial  displacement  u  during  loading  of  the 
cylinder  so  that  r  =  rQ  +  u.  The  true  magnitude  of  the  radial  and  circumferen¬ 
tial  strains  for  incompressible  conditions  are  given  by  the  relations 

(8) 

(9) 

The  compatibility  condition  for  the  thick-walled  cylinder  can  be  obtained  by 
taking  the  derivative  of  eq.  (9)  with  respect  to  rQ 
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r  e 
o 


e8  ^9  =  da  _  u_ 

dr  '  di  r 
o  oo 


(10) 


Equation  (10)  can  be  simplified  by  using  eqs,  (6),  (8),  and  (9)  to  give 


deQ  e-2e9 

r  —2  =  -  -1 

0  drrt 

o  z 


(id 


where  e  is  independent  of  rQ  and  is  assumed  to  be  known. 


based  on  Hencky's  stress-strain  relations: 


Ve8  =  Vfz  _  =  H 

araQ  aQ~ay.  ’  ay.~aT  ‘  2ae 


(12) 


From  eq.  (12)  one  obtains 


£z  =  TT  (Var)  +  er 
e 


(13) 


By  using  Che  condition  of  incompressibility ,  eqs.  (6)  and  (13),  one  can  find 


£r=-T  ltJr'7  U9+£rz)] 


(14) 


Similarly, 


ee  “  [V  T  (tW] 

e 


(15) 


Zz=T 

e 


(16) 
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The  problem  considered  here  is  a  thick-walled  cylinder  subjected  to  internal 
pressure  only.  The  computer  solution  for  the  total-strain,  Incompressible  anal¬ 
ytical  solution  is  obtained  by  specifying  the  deformations  for  the  thick-walled 
cylinder  and  calculating  the  pressure.  This  is  done  by  specifying  values  for  the 
true  strains  and  at  the  inner  radius  of  the  deformed  cylinder.  The  effec¬ 
tive  true  strain  is  specified  as 


Ke. 


(17) 


where  -  a^/E  and  aQi  is  the  value  of  in  eq.  (5)  when  i  =  1,  aQ2  is 

assumed  to  be  the  elastic  limit  of  the  material.  The  axial  strain  ^  is  given  by 
specifying  values  for  (3  in  the  relation 

/ 

ez  =  gKs0  (18) 


The  solution  of  the  differential  equation,  eq.  (11),  can  be  found  in  the  form 


4  *  4 


+ 

e  r 
e  z 


(19) 


where  C  is  the  constant  of  integration  which  is  obtained  from  the  fact  that 
eQ=eQl  ls  known  when  r  *  r^  (rQ  =  r01)»  thus 


solving  for  C 


(20) 


(21) 
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Substituting  C  into  eq*  (19)  gives 


!  ~  r  01  +  r  l®Ez 

_  - - - — 

r  2e  Ez 
o 


If  £q  is  eliminated  by  using  eq.  (6) 


Er  --i*  hpsr  +  ^]  “  pKec 

e1^  o  o 


The  effective  strain  may  be  obtained  from  eq.  (3).  By  eliminating  z c,  one  has 


Ee  %7=- 


e92  +  e0Ez  +  Ez2  =  4  Ee2 


At  the  inner  surface,  ez  and  are  specified;  hence,  one  can  solve  for  =  Sq^. 
By  using  eq.  (20),  can  be  determined. 

The  stress  distributions  in  the  deformed  thick-walled  cylinder  can  be  deter¬ 
mined  as  follows.  Substituting  eq.  (18)  into  eq*  (16)  one  obtains 

(26) 

e 

Equation  (2)  and  eq*  (26)  reduces  to 


2  %/ , 


which  can  be  substituted  into  the  equation  of  equilibrium,  eq.  (7),  to  give 
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(28) 


Before  eq.  (28)  can  be  integrated,  it  is  necessary  that  ae  and  ee  be  expressed  as 
functions  of  r*  This  can  be  carried  out  by  the  following  computational  pro¬ 
cedure, 

COMPUTATIONAL  PROCEDURE 

The  undeformed  thick-walled  cylinder  is  divided  into  N  rings  of  equal  thick¬ 
ness,  For  each  specified  value  (monotone  increase)  of  (3  and  K,  the  following 
computation  will  be  performed: 

Step  1,  Calculate  e^r-r^)  and  ez>  by  using  eqs.  (17)  and  (18). 

Step  2.  Solve  eq^,  by  substituting  and  ez  into  eq.  (25). 

Step  3.  Calculate  r^  by  using  eq.  (20). 

Step  4.  Calculate  C,  by  using  eq.  (21). 

Step  5*  Calculate  ert>  e_>  and  e_  at  each  ring  stations,  by  using  eqs.  (22), 

"  b  z  x 

(18),  and  (23),  respectively. 

Step  6.  The  deformed  position  of  each  ring  can  readily  be  determined  by 
using  the  following  relation 

r  =  rQe£0  (29) 
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Step  7.  The  effective  strain  at  each  ring  station  in  the  deformed  cylin¬ 
der  can  be  determined  by  using  eq.  (3), 

Step  8.  The  loading  function  represented  by  eqs,  (4)  and  (5)  is  used  to 
calculate  the  effective  stress  a&  at  each  ring  station. 

Step  9.  Starting  with  the  known  radial  stress  a  =  -Po  at  the  outer 

-  r  2 

radius,  the  radial  stress  is  obtained  by  numerical  integration 
of  eq*  (28). 

Step  10.  Substituting  ar  into  eq.  (27)  Oq  can  be  calculated. 

Step  11.  Calculate  az  by  substituting  <jq  and  into  eq.  (26). 

Step  12.  For  each  specified  value  of  p,  the  loads  on  the  thick-walled  cylin¬ 
der  are  calculated  for  increasing  values  of  K.  Calculations  are 
stopped  when  the  current  value  of  internal  pressure  is  smaller  than 
the  value  of  internal  pressure  for  the  previous  specified  value  of 
K.  The  bursting  pressure  is  defined  as  the  internal  pressure  P  * 

COMPUTATIONAL  RESULTS 

Based  on  experimental  testing  data,  true-stress  versus  true-strain  relations 
for  the  two  metals,  steel  and  copper,  are  shown  in  Fig.  1.  The  plastic  portion 
of  the  true-stress  versus  true-strian  diagram  is  approximated  by  five  straight 
lines  for  the  SAE  1045  steel  and  by  seven  straight  lines  for  OFHC  copper.  Each 
straight  line  is  represented  by  eq.(5);  values  of  a0  and  m  for  each  straight  line 
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are  listed  in  Table  1.  Each  true-stress  versus  true-s train  diagram  can  be 
approximated  by  a  finite  number  of  straight  lines  with  extreme  accuracy.  If  the 
straight  lines  are  chosen  so  that  the  actual  diagram  lies  above  and  below  the 
staight-line  segment  approximation  with  the  difference  no  more  than  one  percent 
(1%),  the  error  introduced  by  the  apprxomation  will  be  a  fraction  of  1%. 


Table  1. 

Values 

Stress- 

of  m  and  c0  for 
-Strain  Diagram 

Straight  Lines 

Approximating  Plastic  Portion  of 

Straight 

Line 

ksi 

*0 

MPa 

m 

SAE  1045  Steel 

1 

43*4 

299 

0.05083 

2 

54.0 

372 

0.02858 

3 

80.0 

552 

0.00847 

4 

95.0 

655 

0.00309 

5 

111.0 

' 

765 

0.00128 

0FHC  Copper 

1 

2.50 

17.2 

0.17125 

2 

3.25 

22.4 

0.07063 

3 

4.00 

27.6 

0.03125 

4 

5.37 

37.0 

0.01991 

5 

8.40 

57.9 

0.01313 

6 

21.0 

144.8 

0.00450 

7 

39.0 

269.9 

0.00078 
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The  bursting  pressures  for  thick-walled  cylinders  made  of  SAE  1045  steel  and 
OFHC  copper  for  various  radius  ratios  (p=  1.25,  1.5,  .  .  3.75  and  4.00)  were 
calculated  based  on  the  computational  procedure  proposed  in  the  previous  section. 
The  resulting  bursting  pressure  is  plotted  against  the  radius  ratio  as  shown  in 
Figs.  2  and  3. 

The  pressure-expansion  curves  for  cylinders  made  of  SAE  1045  steel  and  OFHC 
copper  are  shown  in  Figs.  4  and  5,  respectively.  Each  curve  is  stopped  when  the 
bursting  pressure  is  encountered. 

The  displacements  of  the  inner  surface  for  thick-walled  cylinders  made  of  SAE 
1045  steel  and  OFHC  copper  were  calculated  by  using  eq,  (20)  and  are  shown  in 
Figs.  6  and  7,  respectively. 


CONCLUSIONS 


By  using  the  true-stress  true-strain  relation  and  finite-strain  approach,  an 
Incompressible  solution  is  developed  to  predict  the  pressure-deformation  rela¬ 
tions  up  to  the  failure  of  thick-walled  cylinders  subjected  to  internal  pressure. 
The  proposed  numerical  computation  procedure  Is  simple  and  very  effective.  The 
total  computation  times  (execution  time)  for  12  thick-walled  cylinders  (p  ®  1.25, 
1.5,  3.75,  4,00)  made  of  SAE  1045  steel  and  OFHC  copper  are  2.871  and  14.195 
CP  seconds,  respectively. 

The  bursting  pressures  and  pressure-expansion  curves  for  thick-walled  cylin¬ 
ders  with  various  radius  ratios  were  presented  graphically.  This  information  can 
readily  be  used  by  the  designer. 
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0.1  0.2  0.3  0.4 

TRUE  STRAIN 

Fig,  1.  True-stress  vs,  true-strain  diagrams  for  SAE  1045  steel  and  OFHC  copper 


TING  PRESSURE 


P 

Fig.  3.  Bursting  pressure  of  thick-walled  cylinders  made  of  OFHC  copper 
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INTERNAL  PRESSURE 


MPa  KSI 


Fig.  5.  Pressure-expansion  curves  of  cylinders  made  of  OFHC  copper  at  outer  surface 
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INTERNAL  PRESSURE 
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LOCATION  OF  INNER  SURFACE 


Fig.  7.  Displacement  of  outer  surface  of  thick-walled  cylinders  made  of  OHlC  copper 
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STRESS  DISTRIBUTION  IN  A  CYLINDRICAL  BAR 
SUBJECTED  TO  CYCLIC  TORSIONAL  LOADING 
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ABSTRACT .  The  modified  version  of  the  endochronic  theory  of  plasticity 
is  applied  to  the  problem  of  a  cylindrical  bar  subjected  to  cyclic  fully- 
reversed  torsional  loading.  The  governing  equations  of  integral  form  are 
derived  for  pure  shear  deformation.  Analytical  techniques  are  employed  in  the 
solution  of  these  equations.  A  material  with  appreciable  cyclic  hardening 
behavior  is  studied.  The  shear  stress-strain  curve  of  such  a  material  under  a 
strain-controlled  condition  is  presented  for  all  cycles  until  a  steady  loop  is 
reached.  The  stress  distributions  in  a  cylindrical  bar  at  different  stages  of 
loading  and  unloading  are  calculated.  Some  numerical  results  are  presented. 

I.  INTRODUCTION.  In  recent  years  much  research  has  been  devoted  to 
developing  realistic  constitutive  equations  to  describe  complex  material 
behavior  such  as  cyclic  plasticity  11-9].  The  majority  of  these  works  are 
along  the  lines  of  the  classical  theory  of  plasticity,  however,  some  attempts 
have  also  been  made  in  developing  new  and  independent  theories.  The 
endochronic  theory  developed  by  Valanis  [8]  is  based  on  the  notion  of 
intrinsic  time  and  thermodynamic  theory  of  internal  variables.  The  original 
definition  of  intrinsic  time  has  led  to  difficulties  in  cases  where  the 
history  of  deformation  involves  unloading.  Valanis  [9]  has  since  introduced  a 
new  concept  of  intrinsic  time  to  overcome  these  difficulties.  The  new  theory 
has  been  successfully  applied  to  describe  the  cyclic  hardening  phenomenon 
under  uniaxial  loading  [10]. 

In  this  paper  the  modified  version  of  the  endochronic  theory  of 
plasticity  is  applied  to  the  problem  of  a  cylindrical  bar  subjected  to  cyclic 
fully— reversed  torsional  loading.  The  governing  equations  of  integral  form 
are  presented  for  pure  shear  deformation.  Analytical  techniques  are  employed 
in  the  solution  of  these  equations.  The  stress  distributions  in  a  cylindrical 
bar  at  different  stages  of  loading,  unloading,  and  reloading  are  calculated. 
The  solution  for  this  problem  based  on  a  numerical  iterative  technique  was 
reported  recently  [11]*  A  different  approach  and  additional  numerical  results 
are  to  be  presented  here. 

II .  ENDOCHRONIC  THEORY .  According  to  the  modified  version  of  the 
endochronic  theory  of  plasticity  [9],  the  intrinsic  time  for  the  one¬ 
dimensional  case  is  defined  by 
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where  tq  is  a  positive  scalar  such  that  0  <  kj  <  1,  t  and  y  are  shear  stress 
and  strain  respectively,  and  y0  is  the  shear  modulus.  When  lq  *  0,  the 
original  form  of  intrinsic  time  is  recovered;  but  when  lq  =  1,  ft  -  y  -  (t/u0) 
is  the  plastic  strain,  and 


d?  -  |dfl| 


(2) 


Equation  (2)  is  used  throughout  this  investigation  so  that  the  concept  of 
plastic  strain  may  be  incorporated  into  the  present  development. 

In  the  case  of  pure  shear,  the  governing  constitutive  equation  of 
integral  form  is  given  by 

.z  dft 

t  =*  p0  /  p(z-z’)  — -  dz*  (3) 

o  dz ’ 

where 


y0p(z)  -  y0p0  6(z)  +  qe"012  +  V2  (4) 

in  which  p0,  ot,  ui,  and  y 2  are  material  parameters,  5(z)  is  the  delta 
function. 

A  more  general  form  of  the  intrinsic  time  measure  involving  the  strain 
rate  effect  is  proposed  in  [10]  as 

dc  =*  k(|ft|)  |dft|  (5) 

where  k,  the  strain  rate  sensitivity  function,  is  a  function  of  ft.  In  this 
case,  Eq.  (3)  can  be  written  as 

,z  Ids 

t  =  y0J  p(z-z’)[±  -]  -  dz’  (6) 

o  k  dz’ 


The  constitutive  Eq.  (6)  is  applied  in  this  paper  to  describe  the  material 
behavior  at  constant  plastic  strain  rate.  The  intrinsic  time  z  is  related  to 
S  by  the  following  time  scale 


dS 

dz 


fCO 


(7) 


where  f(s)  describes  isotropic  hardening  and  is,  therefore,  termed  the 
hardening  function.  In  this  paper,  the  form 

f(z)  a  c  -  (c-l)e"&2  (8) 

will  be  used  because  of  its  simplicity  and  its  proved  usefulness  in  cases  of 
cyclic  loading  [10].  The  parameters  c  and  8  are  material  constants. 


III.  CYCLIC  SHEAR  RESPONSE.  By  using  the  hardening  function  given  in 
Eq.  (8),  the  constitutive  equation  for  loading  with  constant  plastic  strain 
rate  is  written  as 

x  =  (|i0/k)/Z  p(z-z’)f(z')<Jz’  (9) 

o 

where  p(z)  is  defined  by  Eq.  (4)  and  k  is  a  constant.  Integrating  Eq.  (9), 
the  following  explicit  result  Is  obtained: 

t  -  (ty/k)f(z)  +  (ui/k)(g(z)-g(o))  +  (u2/k)h(z)  (10) 


where 


Ty  =  u0p0  is  the  yield  stress  (11) 

h(z)  =■  cz  +  0"l(c-l)(e"Sz-l)  (12) 

g(z),  g(o)  are  the  values  of  the  function  g(z*)  evaluated  at  z’  *  z,  o, 
respectively,  and 

g(z')  =  (c/a)e‘“(z-2')  -  (c-l)/(a-0)e-az+(“-6)z'  (13) 

Equation  (10)  is  now  the  response  function  for  loading*  If  unloading  occurs 
when  the  plastic  strain  reaches  ^i  (or  z  =  zi) ,  then  the  response  function  for 
z  >  zi  can  be  obtained  as 

i  *  ~("Cy/k)f(z)  +  (ui/k)  l-g(z)  +  2g(zi)  -  g(o)]  +  (u2/k)[“h(z)  +  2h(zi>]  (14) 

If  Eqs.  (10)  and  (14)  are  examined  at  z  B  zi~  and  zi+,  respectively,  a  drop  in 
stress  of  2(iy/k)f(zi)  results  during  elastic  unloading. 

If  reloading  takes  place  after  unloading  and,  assuming  that  reloading 
occurs  at  ft  =  (or  z  -  Z2,  where  Z2  >  zi) ,  the  response  function  for  z  >  Z2 
is  obtained  as 

t  -  (Ty/k)f(z)  +  (ui/k)[g(z)  -  2g(z2)  +  2g(zi)  -  g(o)] 

+  “  2h(z2)  +  2h( z i ) ]  (15) 

When  Eqs.  (14)  and  (15)  are  examined  at  z  =  Z2~  and  Z2+,  respectively,  a  jump 
in  stress  of  2(iy/k)f(z2)  is  again  obtained  during  elastic  reloading. 

This  procedure  of  obtaining  a  theoretical  expression  for  each  part  of  the 
cyclic  loading  process  may  be  continued.  Thus,  a  general  expression  of  the 
response  function  for  the  constant  total  strain  amplitude  cyclic  torsion  test 
may  be  found  to  be 

t  »  (-l)N(ty/k)f(z)  +  (ui/k)G(z)  -I-  (u2/k)ft(z)  (16) 
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where 


N 

G(z)  =  C-l)N[g(z)-g(zN)]  +  l  (-l)n+1[g(zn)  -g(zti-l)]  (17) 

n*l 

N 

SUz)  -  (-l)N[h(z)  -h(zN)  +  l  (-l)n+1[h(zn)  -  h(zn-x)  (18) 

n=*l 

Note  that  N  *  0  is  the  number  of  half-cycles,  odd  for  unloading  and  even  for 
reloading. 

After  many  cycles  when  the  values  of  z  become  large,  the  hysteresis  loop 
of  the  stress -strain  curve  will  approach  a  steady  state.  If  Eq,  (16)  is 
examined  at  z  =  zjf* and  z  *  zdj+*  a  drop  or  jump  in  stress  of  magnitude 
2(Xy/k)f(zfl)  results,  which  corresponds  to  the  elastic  response  upon  the 
reversal  of  the  loading  or  unloading  direction.  When  z^  is  sufficiently 
large,  the  jump  or  drop  of  stress  becomes  a  constant  value  2c  ty/k  at  the 
steady  state. 

IV.  CYLINDRICAL  BAR  UNDER  TORSION.  For  a  cylindrical  bar  under 
torsional  loading,  the  external  torque  is 


Tg  »  2tt/  tr2dr 


where  t  is  the  current  shear  stress  corresponding  to  location  r,  and  ra  is  the 
radius  of  cross-section.  Geometric  considerations  show  that  radial  lines  have 
to  remain  straight  after  deformation.  Thus,  one  concludes  that 

Y  -  <Ya/ra>r  (20) 

where  Ya  is  the  strain  at  the  outermost  fiber.  Since  a  yield  stress  is 
introduced  in  Eq.  (4),  an  elastic  core  always  exists  during  deformation  whose 
radius  re  is  given  by 

Ty  ra 

r.  - -  (21) 

Wo  Ya 

If  the  experiment  is  strain-controlled  with  strain  at  ra  varying  between 
-Ya  and  +ya,  and  with  Ya  in  the  plastic  range,  then  the  torque  can  be  computed 
as 

TT  Ya 

Ts  =  -  Ty  ra3  +  2ir(ra/Ya)3/  TY2dr  (22) 

2  Ye 


Ye  *  (Ya/ra)re  »  Y  -  &  +  x/y0  (23) 

t  and  ft  are  given  by  Eqs.  (16)  and  (18),  respectively. 
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Now  that  Y  is  known,  the  values  of  z  and  x  at  each  fiber  can  be 
calculated  and  used  in  Eq.  (22).  Note  that  there  exists  an  explicit 
expression  for  all  cycles 

1 

dy/dz  =■  ii'(z)  +  —  xf(z)  ■  p(z) 

Wo 


where 


fl'(z)  -  (-1)N  f(z)  ,  f'(z>  -  B(c-l)e-&2 


t'(z)  »  (-l)N[Ty/k)£'(z)  +  (M2/k)f(z)]  +  (i»i/k)r'(z) 

N 

r'(z)  -  (-1)n[F'zz  -  F'nzI  -  I  (-l)"(F'nz  '  F' (n-l)zl 

n=l 

F’zz  “  [B(c-l)/(a-S)]e-e2  ,  F'nz  =  -aFnz 

and 

Fnz  =■  -  a-«(z-zn)  -  — )e”az  +  (t»-B)z 

a  oc —  3 


The  integral  in  Eq*  (22)  can  be  replaced  by 


/  xy2dy  =  /  xy2p(z)dz 

Ye  o 


(24) 


(25) 


(26) 


where  za  can  be  calculated  by  Eq.  (23)  with  known  value  Ya.  Now  the  numerical 
integration  becomes  very  easy  because  the  values  of  the  integrand  can  be 
evaluated  directly. 

V.  NUMERICAL  RESULTS  AND  DISCUSSION.  To  apply  the  developed  model  the 
material  constants  (a,  @Y  "c ’Vy,  W0,  Ul,  U2)  in  the  theory  have  to  be 
determined.  These  material  constants  can  be  determined  if  the  cyclic  shear 
stress-strain  curve  for  the  material  has  been  obtained  experimentally*  The 
usual  procedure  is  to  perform  a  cyclic  torsion  test  using  a  thin-walled 
tubular  specimen.  Such  a  test  for  the  annealed  AISI  4142  steel  was  carried 
out  by  the  Plasticity  Research  Laboratory  at  the  University  of  Iowa,  The 
values  of  constants  were  then  used  to  predict  the  results  for  a  solid  bar 
test.  The  theoretical  and  experimental  results  were  in  reasonable  agreement 
[11].  This  real  material  does  not  show  any  appreciable  amount  of  cyclic 
hardening • 

For  purpose  of  investigating  the  implications  of  the  developed  model,  a 
hypothetical  material  with  appreciable  cyclic  hardening  behavior  was  studied. 
The  shear  stress-strain  behavior  of  such  material  under  fully-reversed 
torsional  loading  is  presented  in  Figure  1.  The  material  constants  were 
determined  as:  a  -  1000,  0  ■  50,  c  -  1.5,  xy  *  104  psi,  u0  =  107  psi,  ui  = 
4x10^  psi,  P2  33  0-  A  3teady  loop  is  established  after  a  few  cycles.  The  same 
set  of  constants  were  then  used  to  predict  the  stress  distribution  in  a 
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cylindrical  bar  subjected  to  cyclic  torsional  loading.  We  have  carried  out 
the  computational  process  seven  cycles  after  initial  loading-  The  numerical 
results  are  presented  here  for  the  initial  loading  half-cycle  and  two  cycles 
of  unloading  and  reloading. 

Figure  2  presents  the  numerical  results  for  the  torque  as  related  to  the 
shear  strain  at  the  outermost  fiber-  The  distribution  of  stress  in  the 
cross-section  at  different  magnitudes  of  torque  during  the  initial  loading 
half-cycle  is  presented  in  Figure  3-  The  corresponding  shear  strains  at  the 
outer  fiber  are  0.1,  0.3,  0.6,  and  1.0  percent,  respectively.  Notice  that  the 
outer  fiber  is  the  first  one  to  yield  at  ya  -  0.1  percent;  subsequently  as 
more  torque  is  applied,  the  radius  of  the  elastic  inner  core  gets  smaller. 

Also  notice  that  the  rate  of  hardening  for  each  fiber  decreases  which  is,  of 
course,  in  accordance  with  strain  hardening  phenomena. 

Figure  4  presents  the  distribution  of  the  shear  stress  in  the  bar  at 
three  stages  of  the  first  unloading  half-cycle-  The  top  and  bottom  curves 
correspond  to  the  beginning  and  end  of  the  unloading  stages,  i.e.,  ya  ■  ±  1 
percent.  The  corresponding  values  of  torque  are  1256,  -1387  lb-in- , 
respectively.  The  middle  curve  represents  the  residual  stress  distribution 
when  the  applied  torque  is  equal  to  zero.  A  few  iterations  are  needed  to 
reach  this  state  and  the  residual  shear  strain  at  the  outermost  fiber  is 
0.378  percent.  Figure  5  presents  the  distribution  of  the  shear  stress  in  the 
bar  at  three  stages  of  the  first  reloading  half-cycle,  i.e.,  Ts  =  -1387,  0, 
1461  lb-in.  The  middle  curve  represents  the  distribution  of  residual  stress 
when  the  torque  is  equal  to  zero.  Reverse  yielding  occurs  within  the  outer  12 
percent  of  the  section  when  Tg  ■  0  during  unloading  and  reloading.  The 
distribution  of  residual  stress  during  unloading  is  quite  different  from  that 
during  reloading  as  shown  in  Figures  4  and  5  for  the  first  cycle.  Similar 
results  for  the  residual  stress  distribution  during  the  second  unloading¬ 
reloading  cycle  are  shown  in  Figure  6,  The  solid  curve  is  the  favorable  one 
if  the  applied  torque  during  service  is  in  the  same  direction  as  the  initial 
loading. 
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ABSTRACT .  The  loading  and  unloading  problems  in  thick  tubes  subjected  to 
uniform  internal  pressure  have  been  analyzed  with  the  ADINA  finite  element 
code*  The  elastic-plastic  materials  are  modeled  by  two  strain-hardening 
rules  -  isotropic  and  kinematic.  The  von  Mises  yield  condition,  the 
associated  flow  theory,  and  a  multi-linear  stress-strain  curve  are  used  in 
both  material  models.  The  numerical  results  of  the  stresses  and  displacements 
for  thick  tubes  with  different  wall  ratios  are  obtained  as  functions  of 
loading  history.  A  comparison  of  numerical  results  based  on  two  material 
models  is  made, 

I ,  INTRODU CTI ON .  The  problem  of  pressurized  thick-walled  tubes  is  of 
practical  importance  to  pressure  vessels  and  the  autofrettage  process  of  gun 
barrels.  Many  solutions  for  this  problem  have  been  reported  over  the .last 
three  decades  [1-7],  This  is  a  result  of  different  mathematical  methods,  end 
conditions,  and  material  models.  Different  assumptions  for  Che  material 
properties  such  as  compressibility,  yield;  criterion,  flow  rule,  hardening 
rule,  etc.  can  lead  to  many  material  models,  A  common  feature  in  all  earlier 
Investigations  is  to  Introduce  certain  restrictive  assumptions  so  as  to 
simplify  the  mathematical  analysis  [1-4].  The  recent  development  in  numerical 
methods  makes  It  possible  to  use  more  realistic  material  model  and  to  consider 
more  general  problems.  Both  the  finite  element  method  [5]  and  the  finite 
difference  method  [6,7]  have  been  used  to  solve  the  elasto-plastic  problems 
with  different  end  conditions  and  more  general  loading  conditions.  The 
material  model  was  based  on  the  von  Mises  yield  criterion,  the  Prandtl-Reuss 
flow  theory,  and  the  isotropic  hardening  rule. 

The  finite  element  method  is  more  powerful  and  can  be  used  to  solve  more 
general  nonlinear  problems  [8,9].  Many  finite  element  codes  have  been 
developed  as  seen  In  a  recent  survey  paper  [10].  The  ADINA  code,  developed  by 
K.  J,  Bathe,  is  a  general  purpose  finite  element  program  for  Automatic  Dynamic 
Incremental  Nonlinear  Analysis  [11].  The  standard  version  models  the  elastic- 
plastic  behavior  of  metals  by  the  use  of  the  Mises  yield  criterion,  the 
associated  flow  theory,  and  two  strain-hardening  rules  -  isotropic  and 
kinematic.  Both  hardening  models  were  limited  to  linear  hardening  in  our 
first  version  acquired  in  1981.  The  multi-linear  option  was  allowed  in  our 
second  version  one  year  later.  This  paper  shows  an  application  of  the  ADINA 
code  to  our  pressurized  thick  tube  problems.  A  multi-linear  stress-strain 
curve  is  used  in  both  material  models  and  thick  tubes  of  different  wall  ratios 
are  considered.  The  numerical  results  together  with  a  brief  summary  of  the 
elastic-plastic  theory,  finite  element  formulation  are  presented  below  with 
emphasis  on  the  basic  assumptions  used.  More  detailed  theoretical  Information 
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can  be  found  in  a  forthcoming  report  [12]. 

IX.  ELASTIC -PLASTIC  THEORY.  In  elastic-plastic  analysis  the  material 
behavior  is  described  using  three  properties  in  addition  to  the  elastic 
stress-strain  relations,  namely  a  yield  criterion,  a  flow  rule,  and  a 
hardening  rule. 

The  Initial  and  subsequent  yield  condition  for  isothermal  kinematic  or 
isotropic  hardening  can  be  written  as 

f(<?ij-aij)  “  o(/deP)  =*  0  (1) 

where  is  the  stress  tensor,  <*ij  is  a  tensor  denoting  the  translation  of 
the  yield  surface,  f  is  the  yield  function,  and  a(/deP)  represents  the 
dependence  of  the  yield  stress  on  the  accumulated  increments  of  effective 
plastic-strain.  The  von  Mises  yield  function  for  kinematic  hardening  is 

3 

f  -  [-  Csij-OijKsij-aij)]1/2  (2) 

where 

1 

sij  “  <*ij  "  -  °kk^ij 

and 

ay  =  0  for  isotropic  hardening  (3) 

Restricting  the  analysis  to  associated  flow  rules,  the  plastic  strain 
increment  de^jP  is  derivable  from  the  plastic  potential  function  f  by 

deijP  =  qijdX  and  -  9f/3a^j  (4) 

where  dX  is  a  scalar  to  be  determined. 

During  active  plastic  deformation  the  yield  function  must  be  satisfied 
continuously,  so  that  the  consistency  condition  is 

(dCij-da^j  )  3f/  SOjLj  =0  (5) 

The  original  kinematic  hardening  concept  was  Prager's  rule  [13]  that 

2 

dct^j  -  (-  H')deijP  and  H'  =  da/deP  (6) 

Prager's  rule  was  used  in  the  ADINA.  formulation  although  its  modification  by 
Ziegler  [14]  is  more  popular.  Equations  (1)  through  (6)  are  the  basic 
equations  of  the  elastic-plastic  theory.  In  addition,  we  need  the  elastic 
stress-strain  relation 


daij  =  Fijian  (dSran 


(7a) 


where  is  the  elastic  constitutive  tensor.  If  the  material  is  initially 

isotropic,  then 

Eijmn  =  t$im$jn  +  ^ij  5mn ^  (7b) 


where  E  and  v  are  the  Young’s  modulus  and  Poisson’s  ratio,  respectively. 

Using  the  basic  equations  (1)  to  (7),  we  can  obtain  the  incremental 
stress-strain  relation  for  elastic-plastic  material  models 


dOij  =  Dijmn 

(8a) 

where 

Eij  tu  3tu  <lvw  Evwmn 

Dijmn  *  Eljran  “  TT1 - * - 

H  +  ^kl  Eklrs  ‘Irs 

(8b) 

This  constitutive  relation  holds  for  the  combined  isotropic-kinematic 
hardening  model.  For  the  special  cases  using  Eqs.  (2)  to  (4),  we  have 

isotropic  hardening:  qij  =  3sij/(2a) 

(9a) 

kinematic  hardening:  q^j  =  3(sij-u^j )/(2o) 

(9b) 

III. 

FINITE  ELEMENT  FORMULATION.  The  finite  element  formulation 

used  in 

ADINA  is  very  general  that  large  strain  dynamic  analysis  has  been  considered 
[11,12].  Since  the  present  problem  requires  only  a  small  strain  static 
analysis,  a  very  brief  summary  of  the  special  formulation  is  presented  here. 
The  geometry  of  the  body  is  discretized  by  two-dimensional  8-nodes 
isoparametric  elements.  The  coordinates  and  displacements  are  Interpolated  by 
the  same  shape  functions  i  =  1  to  8,  l.e., 

x  =  %xi  ,  u  =  N^ui  ,  etc. 

(10) 

where  x^,  y^,  u^,  v^  are  the  coordinates  and  displacements  at  the  nodal 
points.  The  strain  increments  in  elements  can  be  obtained  by  differentiation 
and  in  matrix  notation  we  have 

{Ac}  =  [B]{AU>  and  [B]  -  [L)lN]  (11) 

where  [L]  is  a  linear  differential  operator  and  {AU}  is  a  vector  of  all  nodal 
displacement  increments  in  an  element. 

Once  we  know  [D]  and  [B] ,  we  can  compute  the  element  stiffness  matrix  by 

[K]  =  /  [B]t[D] IB]  d(vol)  (12) 

V 

To  carry  out  numerical  integration,  we  express  all  matrices  and  volume  element 
in  terras  of  local  coordinates  and  evaluate  them  at  integration  stations  with 
the  aid  of  Gauss  quadrature  formulae.  For  double  summation  we  use  either 
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(2x2)  or  (3x3)  points  in  a  rectangle.  This  finite  element  formulation  is 
based  on  displacements  so  the  kinematic  equations  and  constitutive  equations 
are  satisfied  locally.  The  principle  of  virtual  displacements  is  used  to 
express  the  equilibrium  of  the  body  in  the  current  configuration.  Since  the 
principle  is  in  integral  form,  we  can  sura  all  element  contribution  to  the 
system, 

IV.  THICK  TUBES.  Consider  a  long  thick  tube,  internal  radius  a,  and 
external  radius  b,  which  is  subjected  to  internal  pressure  p.  Thick  tubes  of 
different  wall  ratios  are  considered.  The  geometry  of  the  tube  Is  discretized 
by  two-dimensional  axiaymmetric  8-nodes  isoparametric  elements  along  the 
radial  direction.  We  use  10  elements  for  smaller  wall  ratios  (b/a  =  1.5  and 
2.0)  and  20  elements  for  larger  wall  ratios  (b/a  *  3,0  and  4,63).  All 
elements  are  of  equal  size  and  3x3  points  are  used  in  carrying  out  the 
numerical  integration.  The  displacements  at  the  nodal  points  and  the  stresses 
at  the  integration  points  are  obtained  as  functions  of  loading  history.  At 
each  stage  of  loading,  we  have  N+l  results  for  the  displacements  and  3N 
results  for  the  stresses  where  N  is  the  number  of  elements  used. 

The  common  input  data  for  both  material  models  are  E  =  2.583xl07  psi,  v  = 
0.3,  and  6  points  on  the  uniaxial  stress-strain  curve,  i.e.,  (a  in  Ksi,  £  in 
%)  =  (155,  0.6),  (167,  0.85),  (172,  1.25),  (177,  3.0),  (181,  5),  (181,  15), 
These  six  points  are  chosen  to  give  a  piecewise  linear  representation  to  the 
actual  stress-strain  curve  for  a  high  strength  steel  as  shown  in  Figure  1. 

The  ADINA  code  allows  a  maximum  of  7  points  to  represent  two  multi-linear 
hardening  models  (model  number  8  and  9  for  isotropic  and  kinematic  hardening) . 
These  two  hardening  models  are  Widely  used  because  of  their  simplicity. 
Isotropic  hardening  is  generally  considered  to  be  a  suitable  model  for  large 
plastic  flows.  Kinematic  hardening  is  the  simplest  theory  that  can  model  the 
Bauschinger  effect.  If  unloading  does  not  occur,  there  is  no  difference 
between  these  two  models.  For  unloading  with  reverse  yielding,  the  finite 
element  results  based  on  these  two  models  will  be  different. 

The  loading  and  unloading  problems  in  thick  tubes  of  different  wall 
ratios  have  been  analyzed  using  the  ADINA  code  and  two  hardening  models.  The 
tubes  of  wall  ratios  1.5  and  2.0  have  been  loaded  to  reach  fully  plastic  state 
and  then  unloaded  completely.  No  reverse  yielding  occurs  during  unloading  for 
tubes  with  both  wall  ratios  and  the  usual  assumption  of  elastic  unloading  is 
justified  on  the  basis  of  these  two  material  models.  The  numerical  results 
for  the  tube  with  b/a  -  2  are  shown  in  Figures  2  through  4.  Figure  2  shows 
the  boundary  displacements  (Ua  and  U^)  as  functions  of  pressure  history.  We 
use  11  steps  during  loading  and  2  steps  during  unloading.  Figure  3  shows  the 
hoop  stress  distribution  at  different  t  steps  (t  -  1,  6,  11,  13)  where  t  is  a 
time-like  parameter  for  the  purpose  of  bookkeeping.  Figure  4  shows  the 
distributions  of  residual  radial  and  axial  stresses  and  equivalent  plastic 
strain.  The  residual  stresses  are  considered  to  be  elastic  according  to  these 
two  models.  The  unloading  process  may  not  be  purely  elastic  if  other  models 
[4]  are  used.  Future  work  should  search  for  a  more  realistic  model  including 
the  Bauschinger  effect  in  a  high  strength  steel  [15].  Experimental 
measurements,  if  available,  should  be  used  for  comparison  with  numerical 
predictions. 
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The  tube  of  wall  ratio  3(a  =  1" ,  b  31  3")  has  also  been  loaded  to  reach 
fully  plastic  state  and  then  unloaded  completely.  We  use  11  steps  during 
loading  and  4  steps  during  unloading.  Figure  5  shows  the  boundary 
displacements  (Ua  and  U^)  as  functions  of  pressure  history.  The  numerical 
results  for  the  displacements  during  unloading  are  very  close  between  the  two 
models#  However,  there  are  noticeable  differences  in  the  size  of  reverse 
yielding  and  the  stresses  within  a  small  zone  near  the  bore.  There  are  60 
stations  along  the  radial  direction  at  which  the  stresses  are  calculated.  At 
the  end  of  complete  unloading,  reverse  yielding  occurs  at  3  or  7  stations  near 
the  bore  according  to  isotropic  or  kinematic  models,  respectively.  Figure  6 
shows  the  stresses  at  a  point  near  the  bore  as  functions  of  pressure  history. 
The  differences  between  the  two  models  for  the  hoop  and  axial  stresses  during 
unloading  are  not  small  as  can  be  seen  in  the  figure. 

Finally,  the  autofrettage  solution  for  a  closed  volume  chemical  "bomb”, 
is  obtained  for  a  tube  with  a  =  0.865"  and  b  =*  4.005".  The  tube  is  loaded  to 
p  a  250  Ksi  in  10  steps  and  then  unloaded  completely  in  5  steps.  At  maximum 
pressure,  26  of  its  60  stations  have  become  plastic.  At  the  end  of  complete 
unloading,  reverse  yielding  occurs  at  2  or  5  stations  near  the  bore  according 
to  isotropic  or  kinematic  models,  respectively.  Figure  7  shows  the  boundary 
displacements  (Ua  and  U^)  as  functions  of  pressure  history.  There  are  small 
differences  for  the  displacements  during  unloading  based  on  two  models.  The 
results  for  the  stresses  within  the  inner  half  of  the  tube  are  presented  in 
Figures  8  and  9.  Figure  8  shows  the  hoop  stress  at  different  stages  of 
loading  and  unloading.  Three  stages  (t  =  1,  10,  15)  represent  the  stage 
corresponding  to  initial  yielding,  maximum  loading,  and  complete  unloading, 
respectively.  The  differences  for  the  hoop  stresses  during  unloading  based  on 
two  hardening  models,  are  not  small  as  can  be  seen  in  this  figure.  Figure  9 
shows  the  differences  for  the  axial  and  radial  stresses  within  the  inner  half 
of  the  tube  after  complete  unloading. 
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Figure  1.  Stress-Strain  Curve  for  a  High  Strength  Steel 


ons  of  Pressure  History 


r 


Figure  A.  Distribution  of  Radial  and  Axial  Stresses 

and  Equivalent  Plastic  Strain  After  Complete 
Unloading. 
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Pressure  History, 


5  =(r-co/(b-a) 


Figure  8.  Distribution  of  Hoop  Stress  in  the  Inner  Half 
at  Different  Stages  of  Loading. 
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A  FINITE  INTEGRAL  TRANSFORM  (FIT)  METHOD  FOR  DIFFERENTIAL 
EQUATIONS  HAVING  ASYMPTOTIC  SOLUTIONS 


Charles  J.  Daly 
Earth  Sciences  Branch 
Research  Division 

Cold  Regions  Research  and  Engineering  Laboratory 
Hanover,  New  Hampshire  03755 


ABSTRACT.  An  analytical  method  is  developed  for  differential  equa¬ 
tions  whose  solutions  y(t)  decay  asymptotically  for  large  t.  The  approach, 
called  the  finite  integral  transform  (FIT)  method.  Is  based  upon  the  use  of 
selected  transform-inverse  pairs  for  which  the  inverses  are  known  explicit¬ 
ly.  Using  the  method,  approximate  analytical  expressions  for  y(t)  are 
obtained  in  the  form  of  series  of  orthogonal  functions  tn(t).  Asymptotic 
behavior  of  the  approximating  series  is  guaranteed  by  requiring  each 
to  be  asymptotic  for  large  t.  The  central  feature  of  the  FIT  approach  is 
the  generation  of  a  system  of  algebraic  equations  which  is  solved  for  the 
vector  of  coefficients  appearing  in  the  orthogonal  series  solution.  The 
FIT  method  amounts  to  a  discretization  of  the  unknown  solution  in  a  fre¬ 
quency  domain.  This  is  in  contrast  to  finite  difference  and  finite  element 
methods  which  discretize  over  the  domain  of  independent  variables.  The  FIT 
approach  is  demonstrated  by  application  to  a  simple  linear  ODE  with  vari¬ 
able  coefficient. 


BACKGROUND.  The  theory  of  finite  integral  transforms  is  well  known, 
having  its  genesis  in  Fourier’s  analyses  of  the  differential  equations  of 
heat  flux.  As  Sneddon  (1972)  points  out,  finite  transforms  arise  from  the 
solutions  of  well-posed  differential  equations  of  the  Sturm-Liouville 
variety.  Since  the  different  eigenfunctions  corresponding  to  a  particular 
Sturm-Liouville  problem  are  mutually  orthogonal,  simple,  yet  extremely 
useful  finite  integral  transform- inverse  pairs  can  be  formulated.  For 
example,  the  problem: 


♦(0)  -  ♦(*)  -  0 


(1) 


gives  rise  to  the  familiar  finite  sine  transform-inverse  pair: 

G(n)  =^(g(  x);  x+n)  -  /  g(x)  sin  nx  dx  (2) 

0 

2  " 

g(x)  =  -r  l  G(n)  sin  nx  .  (3) 

n-1 

Other  examples  are  the  finite  cosine,  Hankel,  Legendre,  Tchebycheff, 
Mellin,  and  Laguerre  transforms  cited  by  Sneddon  (1972). 
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Finite  integral  transforms,  in  general,  have  a  remarkable  and  extreme¬ 
ly  useful  property.  Consider  f(x)  to  be  a  piecewise  continuous  function 
for  which  the  integral  of  a  finite  transform  exists  for  all  n.  The  enumer¬ 
able  set  of  values  F(n)  (the  transform  amplitudes)  amounts  to  a  complete 
description  of  f(x)  over  the  interval  of  definition  of  the  transform, 
excepting  discontinuities.  This  observation  is  the  basis  of  the  FIT  method 
for  solving  differential  equations  that  are  not  amenable  to  the  techniques 
of  classical  analysis.  For  example,  given  that  the  solution  to  a  differen¬ 
tial  equation  g(x)  satisfies  the  boundary  condition  of  (1),  then  there  must 
be  a  finite  sine  representation  of  g  of  the  form  of  (3);  the  problem 
becomes  one  of  determining  the  values  G(n).  Using  this  approach,  Daly  and 
Morel-Seytoux  (1981)  were  able  to  solve  linear  elliptic,  and  time  invariant 
parabolic  PDErs  in  terms  of  finite  sine,  cosine,  and  modified  Laguerre 
transform  series, 

ANALYSIS.  The  Laguerre  polynomials: 


n!  dtn 


(4) 


are  the  basis  of  a  transform-inverse  pair  described  by  McCully  (1960)  and 
Hirschman  (1963): 


A(n)  =G^(a(t);  t+n)  ®  /  e  C  e$_(t)  a(t)  dt  (5) 


a(t)  ■=  l  A(n)  «£n(t)  t  >  0  (6) 

n=0 

From  a  practical  standpoint,  the  usefulness  of  a  transform-inverse 
pair  depends  on  the  ability  to  approximate  the  inverse,  for  all  t,  by  a 
finite  number  of  terms.  Considering  (6)  it  is  easy  to  demonstrate  that  any 
partial  sum  of  two  or  more  terms  diverges  as  t  becomes  large.  A  modified 
Laguerre  transform  which  is  more  appropriate  for  representing  asymptotic 
functions  is: 


Y(n)  =l/(y(t)i  t+n)  =  /  e  t//2  <$n(t)  y(t)  dt 

gp 

y(t)  =  e-t/2  l  Y(n)  S&  (t)  t  >  0 
n=0  n 

To  illustrate  the  FIT  method  consider  the  simple  linear  ODE: 
4?  +  a(t)  y  =  b(t)  y(0)  =  l 


(7) 

(8) 

(9) 
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such  that  y  decays  asympotically  as  t  becomes  large.  Assume  that  a(t)  is 
given  by  M+l  terms  of  (6)  so  that  (9)  becomes: 


|+y  I  A(m)o$m(t)  =  b(t)  y(0)  =  C  (10) 


Taking  the  modified  Laguerre  transform  of  (10)  according  to  (7)  gives: 

n-1 

-25  +  Y(n)  +2  l  Y(p) 

p=0 


+  2  5]  A(m)  /  e  <$n(t)  <^m(t)  y(t)  dt 

m“0  o 


=  2  /  e_t/2Z  (t)  b(t)  dt  i  2  B(n)  (11) 

U  r  n 


where  the  transform  of  the  derivative  is  obtained  by  partial  integration 
(Daly,  1979). 

The  product  of  two  Laguerre  polynomials  can  be  written  as  a  sum  of 
Laguerre  polynomials: 


Q  0  n+m 

4nct>  -  iq 


The  coefficients  R  are  derived 
(ll)  gives: 


n-1  M 

Y(n)  +  2  l  Y(p)  +2  l  A(m) 
p=0  mO 


R(k;  n,m)  <4k(t)  (12) 


in  the  Appendix.  Substitution  of  (12)  in 


n+m 

I  R(k;n,m)Y(k)  =  2(5  +  B(n))  (13) 

k=0 


For  n  =»  0,1,2,.. .N,  (13)  amounts  to  a  set  of  linear  algebraic  equa¬ 
tions  for  the  N+l  values  of  Y(n).  Assuming  that  y(t)  is  adequately  repre¬ 
sented  by  those  N+l  coefficients: 


y(t)  »  e  Y(n)  Jp  (t)  *  y  (t) 

n=0  n  c 


(14) 


A  Gauss-Seldel  iterative  procedure  can  be  used  to  solve  (13)  for  the  Y(n). 
As  an  example,  consider  the  simple  problem: 


||  +  2ty  -  (4t-l)e’t/2  y(0)  =  10 


(15) 
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which  can  be  solved  for  the  exact  solution: 


y.<o 


+  2e 


-t/2 


(16) 


Using  the  FIT  approach,  where  it  is  assumed  that  y(t)  can  be  adequate 
ly  represented  by  N+l  =  21  terms,  (15)  is  solved  for  the  spectral  ampli¬ 
tudes  Y(n)  shown  in  Figure  1-  Amplitudes  are  calculated  to  five  digit 
accuracy  by  the  Gauss-Seidel  procedure* 

Figure  2  is  a  comparison  of  ye(t)  and  yc(t)  given  by  (16)  and  (14) 
respectively-  The  FIT  solution  is  an  excellent  fit  to  the  closed  form 
solution* 


CONCLUSIONS.  The  finite  integral  transform  method  has  been  demon¬ 
strated  for  the  solution  of  a  linear  ODE  with  variable  coefficient*  The 
apparent  success  of  the  procedure  is  an  indication  of  the  potential  of  FIT 
methods  for  solving  more  complex  problems*  Advantages  of  the  approach 
include  the  generation  of  analytic  expressions  and  the  lack  of  node  or  grid 
systems  as  required  by  many  alternative  numerical  methods* 


/ 


554 


APPENDIX*  Because  the  Laguerre  polynomials  are  orthogonal  on  (0,“) 
with  respect  to  (12)  becomes 


R(k;n*m)  =  /  e 
0 


k(t)  4.n(t)  *B(t>  dt 


Expanding  the  product  of  Laguerre  polynomials  gives: 

os  ? 


p-0 

n-Hn  p 
p=0  i =0 

i<n 

i_>p-m 


(-Dp 


_ r  'i  r  m  itp 

it(p-i)!  lp-iJC 


:(p-D 


(Al) 


(A2) 


Let: 


c(p) 


P  (-DP 

ito  iKp-i)i 

i<n 

i>p— m 


(A3) 


Then: 


R(k;n,m) 


n+m  “  o 

l  c(p)  /  e  (t)  tPdt 

p=0  0 


Evaluating  the  integral  of  (A4)  and  substituting  (A3)  gives: 

n+m  -i 

R(kjn,m)  =  l  c(p)  (-1)  (?]p! 

n+m  1  p 

=  I  (-n^  (?)  l  (?)  LZi)  (?) 

p=k  K  i=o  1  p  1  1 

i<n 
Op-m 


(A4) 


(A5) 
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R.  YALAMANCHILI 


TECHNOLOGY  BRANCH,  ARMAMENT  DIVISION 
FIRE  CONTROL  &  SMALL  CALIBER  WEAPON  SYSTEMS_LABORATORY 
U.S.  ARMY  ARMAMENT  RESEARCH  AND  DEVELOPMENT  CENTER,  DOVER,  NJ  07801 


ABSTRACT 


The  most  general  heat  diffusion  equation  possesses  not  only  first- 
and  second-order  derivatives  in  space  but  also  first-and  second-order 
derivatives  in  time*  Therefore,  the  governing  equation  can  be  parabolic, 
elliptic  or  even  hyperbolic  depending  upon  the  parameters  chosen*  The 
model  includes  various  physical  problems,  such  as,  steady  and  unsteady 
classical  heat  conduction  (usually  known  as  classical  Fourier  model) , 
heat  pulse  (Non-Fourier  model),  abrasive  cut-off  and  surface  grinding 
operations  in  machining  of  metal  components.  An  explicit  and  uncondi¬ 
tionally  stable  finite  difference  scheme  is  developed  for  the  general 
purpose  governing  equation.  The  heat  transfer  example  Is  included  to 
discuss  the  accuracy  and  stability  of  this  numerical  scheme. 


I.  INTRODUCTION. 

The  mathematical  model,  considered  here,  represents  various  physical 
problems  in  the  area  of  heat  transfer*  These  include  numerous  problems  in 
steady  and  unsteady  heat  conduction;  large  heat  flux  applications,  such  as, 
plasma  torch  and  hypervelocity  weapon  systems;  and  problems  arising  from 
manufacturing  operations,  such  as,  abrasive  cut-off  and  surface  grinding. 
Although  all  these  physical  problems  look  entirely  different,  one  may  be 
able  to  generalize  them  into  a  single  model  and  prepare  a  single  numerical 
method  for  its  solution.  It  is  important  to  realize  that  the  generalized 
model  can  be  specialized  to  parabolic,  elliptic  or  even  hyperbolic  depending 
upon  the  problem  of  Interest. 
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Currently  the  finite-difference  methods  are  common  for  the  solution  of 
partial  differential  equations  in  both  design  and  research  and  development 
because  of  the  advent  of  super  and  inexpensive  computers.  The  numerical 
methods  for  the  unsteady  heat  diffusion  equation  receive  most  attention 
not  only  because  of  heat  transfer  applications  but  also  because  they  are 
the  basis  for  any  study  of  parabolic  partial  differential  equations.  It 
is  common  to  test  the  heat  conduction  example  first  even  though  the  methods 
are  developed  for  complex  nonlinear  parabolic  partial  differential  equations 
which  may  arise  in  other  fields.  Yalamanchili  [1,  2]  showed  that  the  finite- 
element  and  finite-difference  methods  belong  to  the  class  of  method  of  weight¬ 
ed  residuals.  It  is  also  concluded  that  the  finite-element  method  is  more 
conservative  in  both  stability  and  nonoscillation  characteristics  than  the 
finite-difference  method,  but  not  as  conservative  as  the  method  of  weighted 
residuals.  Since  the  finite-element  method  is  unique  because  of  Gurtin's 
[3]  variational  principle  and  numerous  finite-differences  can  be  constructed 
with  ease,  it  is  found  that  some  finite-difference  schemes  are  better  than 
the  finite-element  scheme  in  accuracy  also.  Therefore,  further  attention  is 
focussed  here  on  finite-difference  schemes  only.  An  example  is  considered 
to  show  where  the  present  solution  method  stands  in  comparison  to  similar 
numerical  solution  methods. 


II.  GENERALIZED  HEAT  CONDUCTION  MODEL, 


The  generalized  model  including  charring  ablation  [4]  may  be  stated  as 


3T 

at 

where  T 
t 
u 

v 

r 

z 

3 


ct 

a 


3T  31  1  32T 

3t  “  Vat‘  I2-  It2" 

=  Temperature 
=  time 


l  a  ,  j  aT  .  .  a  ,  st 

- (arJ  -  )  +  -  (a  - 


3r 


=  speed  of  tool  or  work  piece  in  the  radial  direction 
=  speed  of  tool  or  work  piece  in  the  axial  direction 
=  radial  coordinate 
=  axial  coordinate 
*  0,  plane 

=  1,  axisymmetric  body 


=  thermal  diffusivity 
*  speed  of  sound 


(1) 
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The  following  dimensionless  variables  are  introduced  for  dependent 
and  independent  variables: 


0  =  aR/a  ,  y  *  az/a  ,  <5  -  a2t/a  ,  and  9  =  (T  -  Tq)/(Tw  -  T  )  (2) 

where  R  -  r  for  a  plane 

=  inr  for  axisymmetric  case 
T0  =■  Initial  temperature 

Tw  =  wall  temperature 

The  resulting  governing  equation  in  terms  of  dimensionless  variables 
is  shown  as 


P 


D 


Cuu 

C  90 


r  S® 
G  96 


(3) 


P  equal  to  unity  represents  a  heat  pulse  (large  heat  flux)  problem. 
Otherwise,  it  is  zero.  Q  equals  unity  for  any  transient  case.  C  is  one 
for  abrasive  cut-off,  whereas  G  is  unity  for  surface  grinding.  Otherwise, 
C  and  G  will  be  zero.  The  value  of  D  will  depend  upon  whether  plane  or 
axisymmetric,  as  defined  before.  H  will  be  zero  for  one-dimensional  prob¬ 
lems.  Otherwise,  it  is  one. 


III.  NUMERICAL  METHOD. 

The  dimensionless  governing  equation  is  not  only  general  from  the 
physical  point  of  view,  but  also  mathematically.  This  can  be  reduced  to 
parabolic,  elliptic,  or  hyperbolic  by  appropriate  selection  of  parameters 
P,  Q,  D,  C,  H,  and  G.  It  is  quite  common  to  use  an  entirely  different 
method  depending  upon  its  mathematical  characteristics.  However,  a  simple 
explicit  finite-difference  formulation  is  utilized  to  obtain  the  solution 
numerically. 

The  following  difference  notation  is  utilized: 


Let 


0  =  0(0, y, 5)  -  0 

1 » J  * 

A6^  =  new  time-step 
A6q  =  previous  time-step 

f (6)  =  a  +  b6  +  c62  , 


k 


(4) 
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Choose  5  equal  to  zero  at  a  time  when  the  old  time  step  A6q  has 
been  changed  to  new,  A5  ,  Let  the  corresponding  time  subscripts  be 
k-1,  and  k+1.  The  coefficients  of  the  quadratic  equation  become 


-  f. 


b  = 


£k+l  "  fk 


AS 


n 


f.  nA5  +  f.  ...AS  -  (AS  +  AS  )f. 

k-1  n  k+1  o _ n _ o  k 

A6  (A6  +  A5  ) 

n  o  o 


fk-lA6n  +  fk+lA5o  -  <AAo  +  ASn)fk 

AS  AS  (AS  +  AS  ) 
non  o 


The  derivatives  at  time,  k,  can  be  written  as 


(5) 


3f 

36 


b 


f  -  f 

3f  =  k+1  k 

36  A6 

n 


if  forward  differences  are  utilized. 


(6) 


In  addition  to  these  time  derivatives*  if  central  differences  are 
utilized  for  second  order  spatial  derivatives,  the  governing  equation  in 
difference  format  may  be  written  as: 


2P 


EA6  (A6  +  A6  )  +  AS^i^k+l  =  ^A6  A6  +  A5  "  { +  A3  +  Id?  *  Ayj 

n  n  o  n  non 


2P 


+  _a__  (2D  +£.+  “+£.)]. 


'0,  , 


2P 


0-.  +  (t|t  +  7T)0, 


i.j.k  AS  (AS  +  AS  )  i,j,k-l  SA8Z  Ag'  i+l,j,k 
o  n  o 


(7) 


+  #9i-i,j>k+  *  +  +  ’ 


IV.  STABILITY  ANALYSIS. 

The  difference  equation,  mentioned  above,  is  not  unconditionally  stable. 
According  to  Dusinberre  [5],  based  on  laws  of  thermodynamics,  this  scheme 
is  stable  only  if  the  following  condition  is  satisfied: 


2P  _9  >  2P  c  f  2H  1  G 

A 6  A  <5  A6  A32"  A3  A y1  Ay 

no  n 


(8) 
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However,  substitution  of  the  following  relationship  into  some  of  the 
terms  of  the  above  difference  equation  yiels  an  unconditionally  stable 
difference  equation. 


9i, j 


A5  0  t  +  A6  0.  .  iii 
n  x,  j  ,k-l  o  Tk+1 

A  (5  +  AS 

o  n 


(9) 


This  relatio  implies  a  linear  variation  of  temperature  between  times 
subscripts  (k-1)  and  (k+1) .  the  resulting  unconditionally  stsble  fifference 
equation  is  given  below: 


A0 .  .  _ 


2p  Q  DC 

(aTm;  +  -  B0i,j,k-1  +  (IF  +  Zs)0i+i(J,k 


+  Q  +  (  -ft.,  +  — )0  4“  — 0 

A67  i-lf j jk  VA6Z  A6J  i,j+l,k  A67  i,j-l,k 


(10) 


Where  A  = 


2P 


+  -2-  +  +  IL_  2H  «v 

T  A  £  ^  V  A  OZ  ~  A  D  “  A  Jf  2  T  AX1' 


AS 


AS  (AS  +  AS  )  AS  VA$^  Ag  AS2^  Afi'  AS  +  AS 
n  n  o'  n  no 


(11) 


and  B 


2P 


,  ,2D  C  H  G  . 

+  +  Aft  +  a*  2  +  A*) 


AS. 


AS  (AS  +  AS  )  vAfF  Ag  AS^  AS-'  AS  +  AS 
o  n  o  no 


(12) 


The  truncation  error  of  the  difference  equation  did  not  increase  due  to 
the  assumption  of  linear  variation  of  temperature  within  any  two  consecutive 
time-steps.  Surprisingly,  the  truncation  error  is  reduced  somewhat  due  to 
the  cancellation  of  some  of  the  terms.  The  subitution  of  an  assumption  also 
elimates  one  deficiency  exhibited  by  the  Dufort  and  Frankel  method  [6]  for 
nonlinear  problems  in  that  large  variations  in  1  otr  can  be  tolerated  without 
introducing  significant  errors. 

The  unconditional  stability  can  be  shown  by  assuming  an  error,  E,  as  a 
linear  combination  of  terms  of  the  form 


k  A  iftAg  v^T  j<f>Ay 
si,j,k  '  *  *  e 


(13) 


and  substituting  into  the  final  difference  equation,  one  can  obtain  the 
relation  for  damping  ratio  (n)  as 


n2  ■  r1  + 1  3  0  • 


(14) 
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Where  E 


2P  „  Q  .  ,  D  ,  C  ,  v^IftAB  .  D  -/^ftAB  ' 

“  ITiT  +  AS  +  (aF  +  AS  )e  +IFe 

non 

+  A  +  +  H  s-^iHy  .  as) 

Ay*  Ay  Ay'41 


It  can  be  shown  that  the  magnitude  of  the  damping  ratio  (for  amplitude) , 
n  ,  will  be  less  than  or  equal  to  unity  for  any  chosen  parameters.  Therefore, 
the  final  difference  equation  will  remain  stable  for  any  constant  integration 
interval  or  when  the  new  interval  is  not  larger  than  the  old  step  plus  the 


critical  step  increment 


aA<5  / (AS' 
o 


+  Ay2). 


V.  PARABOLIC  EXAMPLE. 

An  exact  analytical  solution  [7]  exists  for  the  one-dimensional  problem 
with  zero  initial  temperature  everywhere  and  length,  L.  The  boundary  conditions 
are  a  step  change  to  a  temperature  of  unity  on  one  end  and  insulated  (zero  heat 
flux)  on  the  other  end.  Instead  of  approximating  the  heat  flux  boundary  con¬ 
dition  by  a  difference  equation,  one  can  also  consider  a  body  of  length  2L  and 
apply  a  step  change  in  temperature  to  unity  on  both  ends.  In  this  case,  there 
is  no  error  involved  in  consideration  of  zero  heat  fluxes  or  insulated  surfaces. 
The  typical  computer  time  is  18  seconds  for  40  spatial-steps  and  600  time-steps. 
The  results  obtained  by  the  present  method  as  well  as  by  several  other  methods 
are  shown  in  Tables  1  and  2. 


TABLE  1.  Comparison 

of  Errors  (xlO5) 

Method 

X/L=0 .2 

X/L=0.6 

X/L=0 . 8 

Analytical 

.080929 

.371439 

.654747 

Cr ank-N icho 1 son [ 8 ] 

121 

244 

175 

Saul ’yev[9] 

218 

252 

231 

Liu [10] 

351 

240 

134 

Dufort-Erankel 

90 

367 

296 

Present 

194 

96 

35 
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TABLE  2.  Comparison  of  Errors (XIO^) 


Method 

X/L=0 .2 

X/L-0.6 

X/L-0.8 

Analytical 

.647367 

.782053 

.885416 

Crank-Nicholson [ 8 ] 

4.1 

2.6 

1.4 

Saul *yev[9] 

'  120 

72 

33 

Liu [10] 

72 

45 

26 

Dufort-Frankel 

126 

79 

42 

Present 

27 

16 

9 

VI.  CONCLUSIONS , 

The  Fourier  number  is  0.1  for  the  results  in  Table  1.  The  present 
method  is  more  accurate  than  not  only  other  stable  explicit  schemes  but 
also  the  famous  implicit  scheme  at  dimensionless  positions  of  0.6  and  0.8, 
Similar  results  are  shown  in  Table  2  for  a  later  time  (Fourier  number)  of 
0,5.  The  present  method  is  much  better  than  any  other  stable  explicit 
scheme  for  all  positions.  Only,  the  Crank-Nicholson  implicit  scheme  ex¬ 
ceeded  the  accuracy  of  the  present  approach  for  larger  dimensionless  times 
(but  not  for  small  times).  However,  the  Crank-Nicholson  scheme  requires 
more  computer  time  because  of  implicit  procedures  than  the  present  method. 
Therefore,  the  generalized  model  and  the  numerical  method  are  a  great  asset 
for  scientists  and  engineers. 
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NUMERICAL  BOUNDARY  CONDITIONS  FOR  FLOW  PROBLEMS 
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ABSTRACT.  The  development,  analysis,  and  implementation  of 
numerical  boundary  conditions  for  flow  calculations  in  infinite 
domains  are  discussed.  Emphasis  is  placed  on  potential  flow, 
periodic  acoustic,  and  incompressible  viscous  flow  problems.  In 
all  cases,  the  infinite  domain  problems  are  approximated  by 
problems  posed  on  a  bounded  domain.  To  close  the  numerical 
problem,  an  artificial  numerical  boundary  condition  is  imposed. 

The  effect  of  these  approximate  boundary  conditions  on  the  accuracy 
of  the  numerical  computations  is  examined. 

I.  INTRODUCTION.  When  a  physical  problem  is  posed  on  an 
unbounded  domain,  its  solution  cannot  be  directly  approximated 
by  a  numerical  technique.  There  are  four  classes  of  methods 
employed  to  treat  such  problems.  The  first  is  to  map,  either 
numerically  or  analytically,  the  infinite  domain  into  a  finite 
one,  and  then  discretizing  the  governing  partial  differential 
equations  by  standard  techniques,  e.g.,  finite  difference  or 
finite  element  methods.  This  class  of  methods  has  often  proved 
to  be  quite  successful,  especially  for  potential  flow  problems. 
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However,  for  complicated  regions,  e.g. ,  the  exterior  to  an  airfoil, 
the  numerical  generation  of  the  mapping  function  may  be  rather 
costly.  Furthermore,  for  problems  with  oscillatory  solutions, 
e.g.,  periodic  acoustic  problems,  mapping  to  a  finite  domain  just 
trades  the  problem  of  the  infinite  domain  for  the  equally  impossible 
problem  of  resolving  very  rapidly  (in  fact,  infinitely  rapidly) 
oscillating  solutions  within  the  new  bounded  domain. 

A  second  popular  approach  for  exterior  problems  is  to  transform 
the  partial  differential  equation  problem  on  an  infinite  domain  into 
an  integral  equation  problem  posed  on  the  finite  boundary  of  the 
domain.  The  integral  equation  may  then  be  discretized  by  the  usual 
techniques.  This  type  of  approach  is  limited  to  problems  for  which 
a  free  space  Green's  function  is  known.  Furthermore,  discretization 
of  the  integral  equation  leads  to  dense  (but  finite)  matrix  problems. 

A  third  method  of  treating  infinite  domain  problems  is  related 
to  the  integral  equation  approach.  This  approach  couples  a  far 
field  solution  with  a  discretization  by  finite  differences  or 
finite  elements  in  a  bounded  region.  The  coupling  is  effected 
by  introducing  auxiliary  variables,  defined  on  the  artificial 
boundary  of  the  truncated  domain,  and  requiring  these  auxiliary 
variables  to  satisfy  an  integral  equation  along  that  boundary. 

This  integral  equation  carries  information  about  the  solution 
in  that  part  of  the  original  domain  lying  outside  the  truncated 
domain.  In  order  to  apply  this  method,  we  need  only  know  a  free 
space  Green* s  function  for  the  differential  operator  governing 
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the  solution  outside  of  the  truncated  domain.  One  way  to  view 
this  method  is  to  think  of  it  as  solving  the  partial  differential 
equation  problem  on  a  bounded  truncated  domain,  with  the  integral 
equation  serving  as  an  exact  non-local  boundary  condition.  By 
non-local  we  mean  that  the  (unknown)  solution  at  all  the  points 
on  the  artificial  boundary  are  coupled.  This  method  is  effective 
but  involves  the  introduction  of  additional  unknowns  and  often 
results  in  complications  in  the  resulting  linear  systems  which 
must  be  solved. 

The  fourth  method,  which  is  the  one  considered  in  this  paper, 
again  solves  the  partial  differential  equation  problem  on  a 
truncated  domain.  However,  we  now  impose,  on  the  artificial 
boundary,  an  approximate  local  boundary  condition.  By  local, 
we  mean  that  the  boundary  condition  holds  pointwise  at  the  boundary. 
Since  this  approach  involves  only  an  approximate  boundary  condition, 
it  is  desirable  to  develop  artificial  boundary  conditions  of  high 
accuracy.  Indeed,  the  method  can  be  cost  effective  only  when  the 
truncated  domain  is  relatively  small  in  its  extent.  However,  when 
the  method  works,  it  is  the  simplest  to  implement,  since  it 
essentially  consists  of  a  direct  discretization  of  the  partial 
differential  equations  on  a  bounded  domain  with  local  boundary 
conditions,  i.e.,  as  far  as  the  discretization  technique,  it  is 
essentially  one  for  an  interior  problem. 

In  Section  II,  we  discuss  potential  flow,  periodic  acoustic, 
and  incompressible  viscous  flow  problems  in  exterior  domains. 
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In  Section  III  we  discuss  incompressible  viscous  flow  problems 
in  channels.  We  also  refer  the  reader  to  [1]  which  was  presented 
at  a  previous  Army  conference,  which  treats  numerical  boundary 
conditions  for  unsteady  wave  propagation  problems* 


II.  FLOW  PROBLEMS  IN  EXTERIOR  DOMAINS*  In  this  section  we 

first  consider  problems  governed  by  second  order  elliptic  partial 

differential  equations  and  which  involve  exterior  domains,  i.e., 

2  3 

domains  which  are  the  exterior  of  a  bounded  body  in  R  or  R  . 
In  the  vicinity  of  the  body,  the  "near  field",  the  equation  may 
have  variable  coefficients.  However,  away  from  the  body,  the 
"far  field",  we  assume  that  the  equation  reduces  to  one  with 
constant  coefficients,  at  least  in  an  asymptotic  sense.  We 
introduce  an  artificial  boundary,  denoted  by  TR,  and  assume 

_  that  it  contains  the  inner 


boundary  which  is  denoted  by  T. 
The  original  infinite  domain  we 
denote  by  0,  and  the  truncated 
domain,  i.e.,  the  region  between 
P  and  rR,  by  C^.  We  assume 
that  the  artificial  boundary  TR 
is  placed  sufficiently  far  away 


from  the  inner  boundary  r  so  that  the  differential  equation,  in 


the  vicinity  of  TR,  is  already  in  its  far  field,  "constant" 
coefficient  form.  Also,  in  practice  one  would  naturally  pick  TR 
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to  be  geometrically  simple,  e.g.,  a  box,  circle,  or  sphere.  The 
method  essentially  assumes  that  the  given  differential  equation 
holds  in  the  region  that  the  given  boundary  conditions  hold 

on  r,  and  that  the  truncated  problem  is  closed  by  choosing  an 
artificial  boundary  condition  on  the  artificial  boundary  TR. 

The  particular  choice  of  this  artificial  boundary  condition  is 
crucial  to  the  design  of  an  effective  algorithm.  The  specific 
boundary  conditions  described  below  were  first  developed  and 
analyzed  in  [ 2 ] . 

As  a  prototype  acoustic  problem,  consider  the  following 

in  Q  (1) 

on  r.  (2) 

In  order  to  ensure  that  a  unique  solution  exists,  we  must  also 
impose  a  radiation  condition,  i.e. ,  the  Sommerfeld  condition 
(in  R3) 


Helmholtz  type  problem.  We  have 

Au  +  k^u  =  f 


-iku  +  ||  =  Q(~)  as  r  -  os,  (3) 

which  essentially  allows  only  outgoing  waves.  In  (1)  k  is  the 
given  frequency  and  f  is  a  given  source  which  we  assume  is  of 
compact  support  or  decays  sufficiently  rapidly  as  r  -*  oo .  In 
the  near  field,  (1)  may  be  replaced  by  a  variable  coefficient 
equation,  while  in  the  far  field,  (1)  may  be  replaced  by  more 
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complicated  constant  coefficient  differential  equations  such  as 
a  reduced  convected  wave  equation.  The  boundary  condition  (2) 
may  be  replaced  by  a  Dirichlet  boundary  condition  and/or  mixed 
boundary  conditions. 

Now,  it  is  known  [3]  that  the  solution  of  our  problem  ( 1 ) — ( 3 ) 
may  be  represented  by  the  convergent  expansion 


u 


ikr  oo  F.(S,cp) 
^ -  2  -j - r- 

kr  j=0  (kr)  3 


(4) 


where  8,  cp  represent  spherical  angles.  One  easily  verifies  that 
for  u  given  by  (4) 


-iku  + 


s  ■ 


as  r  -*  co 


(5) 


so  that  (3)  is  certainly  satisfied. 

Now,  our  truncated  problem  is  given  by 


Au  +  k  u  =  f 
m  m 


du 


m  _  ^ 

g 


B  u  =  0 
m  m 


in  Oj, 

on  r 
on  rR 


(6) 

(7) 

(8) 


where  is  an  operator  to  be  defined  below.  The  goal  on  the 

one  hand,  is  to  choose  the  operator  so  that  the  difference 

(u  -  um) ,  i.e.,  the  difference  in  the  exact  solution  of  the  infinite 
domain  problem  and  the  exact  solution  of  the  truncated  problem,  is 
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as  small  as  possible.  On  the  other  hand,  we  wish  to  choose  B 

m 

so  that  the  discretization  of  (6) -(8)  is  straightforward. 

Our  choice  of  boundary  conditions  is  motivated  by  the 
representation  (4) .  Indeed,  consider  the  family  of  operators 


Bm  =  a  (JL-ik  +  ^i)  =  (A - ik  +  b  . 

in  •_1  vdr  r  '  vdr  r  /  m-1, 


(9) 


It  is  easily  verified  by  applying  (9)  to  (4)  that 


B  U 
m 


r=R  =  0(l/R2m+1) . 


(10) 


Thus  we  see  that  by  requiring  that  (8)  hold,  we  are  making  an 
error  of  0(1/R2m+1)  at  the  artificial  boundary  TR  if  the  latter 
contains  the  sphere  of  radius  R.  Thus,  insofar  as  minimizing  the 
error  made  at  the  artificial  boundary,  one  would  like  to  choose  m 
as  large  as  possible.  However,  practical  implementation  considers- 
tions  preclude  choosing  m  >  2.  Basically,  the  difficulty  is  due 
to  the  fact  that  involves  an  m-th  order  partial  differential 

operator  with  respect  to  r.  For  m=l,  this  poses  no  problems; 
for  m>  1,  one  can  use  the  differential  equation  to  eliminate 
all  r-derivatives  of  order  greater  than  one.  However,  this 
procedure  proves  to  be  practical  only  for  the  case  m=2.  See  [2] 
for  details.  For  these  reasons,  we  focus  attention  on  the  operators 

Bi  =  (^-ik  +  ?)  (Hi 

and 

b2  =  (y?-ik+i)-  <12> 
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Note  that  the  Sommerfeld  operator  is  not  a  member  of  our 

family  B. 

m 

We  should  monitor  the  error  in  the  truncated  problem  as  a 
function  of  the  frequency  as  well;  indeed,  the  constants  in  the 
order  relation  (10)  are  functions  of  k.  The  representation  (4) 
suggests  that  the  proper  parameter  is  (kr) ,  although  we  should 
keep  in  mind  that  the  functions  Fj(9,cp)  also  depend  on  k. 
However,  we  have  computational  and  theoretical  evidence  [2]  that 
indeed  as  k  increases,  the  error  in  our  boundary  condition 
decreases.  This  partially  mitigates  the  resolution  problem.  It 
is  well  known  that  as  k  increases,  one  must  decrease  the  grid 
size  h  in  order  to  accurately  resolve  waves,  (kh)  being  the 
relevant  parameter.  However,  as  k  increases,  we  are  able  to 
bring  the  outer  boundary  closer  to  the  inner  boundary,  thus  making 
the  computational  region  smaller. 

In  two  dimensions,  an  analogous  family  of  boundary  operators 
may  be  developed.  They  are  based  on  the  asymptotic  expansion 


u 


ei  (kr-7T  / 2) 


f.  (8) 
2  -1  . 
j=0  r-1 


(13) 


(Actually,  an  exact  representation,  analogous  to  (4)  is  known, 
but  as  it  involves  Hankel  functions,  (13)  is  easier  to  work  with.) 
The  family  of  operators  is  now  given  by 


(14) 
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Also,  for  Poisson  problems  where  k  ~  0  in  (1)  ,  we  may  use  the 
multiple  expansion 


u 


rj 


3 

(in  R  )  to  generate  appropriate  boundary  operators.  These  simply 

turn  out  to  be  given  by  (9)  with  k  =  Q.  Similarly,  we  may  relate 

2 

the  Poisson  problem  in  R  to  (14)  with  k=Q. 

In  [2]  are  collected  many  computational  and  theoretical 
results  concerning  the  accuracy  of  the  solution  of  the  truncated 
problem  (6) -(8).  Note  that  from  (1),  (2),  (6) -(8),  and  (10)  that 
the  difference  e  =  u  -  u  satisfies  the  problem 


The  questions  answered  in  [2]  are  how  does  the  known  error  committed 

at  the  boundary  affect  the  error  in  the  interior  and  at 

the  inner  boundary  T.  Roughly  speaking  it  is  found  that  error  in 

2 

the  interior  is  of  0(1/R  )  for  the  boundary  condition  and  is 

3 

of  Q(l/R  )  for  the  boundary  condition  B2-  Also  in  [2]  is  found 
a  discussion  of  the  implementation  of  the  boundary  conditions 
and  in  conjunction  with  finite  element  discretizations  of 

(6)  -  < 8)  . 
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Although  the  results  of  [2]  are  for  second  order  elliptic 

partial  differential  equations,  the  artificial  boundary  conditions 

described  above  have  been  successfully  used  in  other  settings,  e.g., 

transonic  flow  calculations  with  a  subsonic  free  stream  [4], 

Recently,  artificial  boundary  conditions  have  been  studied 

for  incompressible  viscous  flows  in  exterior  domains  [5].  Here, 

the  governing  equations  are  the  Navier-Stokes  equations.  We 

assume  that  in  the  far  field  the  flow  approaches  a  uniform  flow. 

indeed,  it  is  known  that  the  velocity  u  and  pressure  p  approach 

their  free  stream  values  and  p^  at  the  rates  0(1/R)  and 

0(1/R2),  respectively.  The  following  artificial  boundary  conditions 

have  been  studied.  First,  we  simply  impose  u  =  at  the 

artificial  boundary.  This  is  often  done  in  the  literature?  however, 

we  show  that  this  is  unsatisfactory  from  a  computational  point  of 

view.  Indeed,  the  0(1/R)  error  made  at  the  boundary  PR  can 

lead  to  0(1)  errors  in  the  interiors,  and  at  best,  in  special 

1/2 

situations,  yields  an  0(1/R  )  error  in  the  interior,  A  second 

boundary  condition  is  to  impose  a  zero  stress  boundary  condition 
i  ■  e  •  , 

( p  -  PqJ  n  +  grad  u  *  n  =  0  on  TR .  (15) 

2 

We  show  that  this  boundary  condition  has  an  error  of  0(1/R  ), 

2 

i.e.,  the  actual  stresses  on  TR  are  of  0(1/R  ).  Furthermore, 
we  show  that  the  solution  of  the  truncated  problem  then  differs 
from  that  of  the  infinite  problem  by  0(1/R)  in  the  interior  of 


576 


the  truncated  region.  Although  this  is  a  significant  improvement 
over  the  errors  incurred  by  specifying  the  velocity  on  we 

feel  that  the  results  are  still  not  good  enough  for  practical 
computations.  Indeed,  we  note  that  (15)  plays  the  role  of  the 

2 

Sommer f eld  condition,  i.e. ,  they  both  are  in  an  error  of  0(1/R  ). 
Therefore,  present  work  is  centered  on  developing  boundary  conditions 
of  higher  accuracy,  analogous  to  the  family  Bm  described  above. 

The  starting  point,  analogous  to  the  representation  (4) ,  is  the 
theory  of  hydrodynamic  potentials. 

To  close  this  section,  we  make  some  remarks  concerning  finite 
element  discretizations  of  the  truncated  problem.  Once  we  have 
defined  our  truncated  problem,  e.g. ,  (6)-(8),  it  seems  like  a 
straightforward  task  to  discretize  it  by  a  finite  element  (or 
finite  difference)  scheme.  After  all,  the  truncated  problem  has 
the  appearance  of  a  standard  bounded  domain  problem.  However, 
from  a  practical  point  of  view,  a  naive  discretization  of  a 
problem  such  as  (6)-(8),  e.g.,  using  quasi-uniform  grids,  will 
usually  require  too  many  degrees  of  freedom,  and  thus  lead  to 
an  inefficient  algorithm.  Advantage  must  be  taken  of  the  fact 
that  the  exact  solutions  decay  as  r  increases.  The  way  to  do 
this  is,  of  course,  to  grade  the  mesh,  using  increasing  mesh 
sizes  as  r  increases.  The  apparent  decrease  in  accuracy  caused 
by  the  increasing  mesh  size  is  counteracted  by  the  fact  that  the 
exact  solution,  measured  in  the  norms  appearing  in  the  error 
estimate,  is  decaying.  By  balancing  these  effects,  one  can  easily 
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design  near  optimal  grids  which  lead  to  great  savings  in  computer 
memory  and  processing  time  when  compared  to  a  naive  approach. 

See  [5]  for  a  more  detailed  discussion  of  these  ideas. 


HI.  CHANNEL  PROBLEMS  FOR  VISCOUS  FLOWS.  In  this  section  - 
we  briefly  consider  downstream  boundary  conditions  for  incompressible 

viscous  flows  in  channels.  A  typical 
configuration  is  given  by  the 
accompanying  figure.  Here,  the 
^  boundaries  and  F2  represent 
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solid  walls,  T1  represents  an  inflow 
boundary,  and  FQ  represents  an 
outflow  boundary.  Along  F^ ,  F^ 

and  F2,  boundary  conditions  are  given,  e.g.,  along  F^  and  F2 
the  velocity  vanishes  and  along  FI  the  velocity  is  specified. 
However,  along  Fq  a  boundary  condition  is  not  known  and  therefore 
an  artificial  numerical  boundary  condition  must  be  imposed  in  order 
to  define  a  solvable  discrete  problem.  In  the  absence  of  accurate 
asymptotic  information  at  the  outflow,  one  variously  imposes  some 
derivative  of  the  solution  at  the  outflow  boundary.  In  [6],  we 
carried  out  an  analytical  and  computational  study  of  the  errors 
incurred  by  the  imposition  of  such  artificial  boundary  conditions. 
These  studies  were  carried  out  for  model  linear  transport  equations 
and  for  the  Navier-Stokes  equations.  Four  choices  for  the  artificial 
boundary  conditions  were  considered.  These  involve  specifying 
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(16) 


or 


or 


or 
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d2u 


dx* 


'  R 


0  y 

ss  u  dx  dy 
-d  0 


(17) 


(18) 


(19) 


where  In  the  last  integral,  we  refer  to  the  coordinate  system 
defined  in  the  figure,  and  d  is  a  fixed  number.  The  first  three 
choices  seem  "natural" ,  and  especially  the  second  and  third  have 
been  variously  recommended  in  the  literature.  The  last  choice, 
although  seemingly  "unnatural",  is  of  use  in  certain  geophysical 
problems.  See  [6]  for  details.  In  (16)— (19) ,  u  represents  a 
component  of  the  velocity  field,  or  perhaps  the  stream  function 
or  vorticity  if  a  formulation  involving  those  variables  is  employed. 
The  actual  values  assigned  to  any  of  (16) -(19)  are  either  determined 
by  known  boundary  data  (which  of  course,  should  be  used  whenever 
available) ,  or  is  arbitrarily  set,  e.g. ,  to  zero,  when  unknown. 

The  study  of  [6]  was  aimed  at  determining  the  effect  of  the  latter 
type  specification  on  the  solution  of  the  channel  problem. 

It  was  found  that  the  incorrect  specification  of  any  of  the 
boundary  conditions  (16) -(19)  resulted  in  errors  only  in  a  boundary 
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layer  adjacent  to  the  outflow  boundary  TR.  The  thickness  of 

this  boundary  layer  decreased  with  increasing  Reynolds  numbers. 

See  [6]  for  a  detailed  discussion. 
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ABSTRACT .  A  type  insensitive  scheme  based  on  a  weighted  least  square 
variational  principle  is  used  in  conjunction  with  appropriate  finite  element 
spaces.  Shocks  are  treated  through  a  modified  density  formulation  natural 
to  this  scheme.  Selected  numerical  results  are  reported  along  with  a  qualita¬ 
tive  analysis  of  the  approximations . 

I .  INTRODUCTION .  Finite  element  schemes  based  on  variational  principles 
of  the  least  squares  type  have  a  number  of  advantages  as  well  as  disadvantages 
when  applied  to  compressible  flow  problems.  The  most  striking  advantage  is 
their  insensitivity  to  type;  i.e.,  the  approximations  are  typically  just  as 
accurate  for  supersonic  flows  as  they  are  in  the  subsonic  case.  Moreover , 

in  transonic  flows,  points  in  the  supersonic  region  do  not  require  special 
treatment.  A  closely  related  property  is  ability  of  this  type  of  approach  to 
generate  Hermitian  definite  algebraic  systems. 

The  primary  defect  of  the  least  squares  approach  is  that  their  accuracy 
deteriorates  in  the  presence  of  shocks  or  other  types  of  singularities. 
Moreover,  this  deterioration  is  far  more  severe  than  is  seen  in  other  types 
of  finite  element  or  finite  difference  formulations. 

In  this  paper  we  treat  this  problem  using  suitably  weighted  norms  in  the 
least  squares  formulation.  This  approach  does  require  a  priori  knowledge 
about  certain  qualitative  features  of  the  solution,  but  given  this  knowledge, 
it  along  with  appropriate  mesh  refinement  (or  singular  elements)  can  be  used 
to  accurately  model  singular  behavior. 

II.  WEIGHTED  LEAST  SQUARES  FORMULATIONS.  For  simplicity  we  consider, 
steady  potential  flows.  The  governing  equation  is 

div  p  grad  cp  =  0  in  (1) 

where  cp  is  the  velocity  potential  and  p  is  the  density.  Bernoulli's 
equation  permits  us  to  write  p  as  a  function  of  the  flow  velocity 

v  =  grad  cp.  The  problem  statement  is  completed  by  supplying  conditions  on 
the  boundary  T  of  the  flow  region  fl; 

grad  :p*  V  -  on  T.  (2) 

Here  y  is  the  outer  normal  to  T  and  is  the  given  normal  velocity. 

On  solid  walls  where  there  is  no  normal  flow  we  have  v^  =  0. 

We  reformulate  (1)  in  terms  of  the  mass  flow 

u  =  p  grad  cp.  (3) 
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div  u  =  0 


(4) 


Indeed ,  (1)  is  equivalent  to 


curl(-^)  -  0  (5) 

where  the  density  p  is  now  regarded  as  a  function  of  u.  In  addition  we 
assume  the  boundary  condition  takes  the  form 

u  *  V  =  on  r.  (6) 


for  a  given  normal  mass  flow  u^. 


To  approximate  this  problem  we  select  a  finite  element  space^lf^,  functions  in 

which  must  be  continuous.  We  then  seek  an  approximate  mass  flow  u,  in  If,  . 

h  h 

This  function  is  defined  by  first  applying  Newton's  method  to  (4) -(6).  Indeed, 

-+(0) 

suppose  ll  is  an  initial  guess  to  the  mass  flow,  and  let 


<J(u) 


1 

P(u) 


(7) 


Then  one  step  of  Newton's  method  produces 


=  ^0) 


+  tl 


(8) 


where  the  correction  satisfies  the  linear  problem 

-»  '*(01 
div  Uh  -  -  div  u^UJ 

curl(aQUh)  +  curl  [(grad  Oq  -U^u^0*]  “  -curl (CgU^0* ) 


(9) 

(10) 


-*(0) 

u.  *  v  -  -u:  *  v  on  r.  (id 

h  —  — h  — 

-*(0) 

In  (10),  <JQ  denotes  <J(u^  ).  To  solve  (9) -(10)  we  use  the  least  squares 

method.  In  particular,  we  require  that  U,  minimizes 
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J  {. |  div  [U  +  u 


ft  .  -(0) 
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]  I2wn 


| curl [Qq u 
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+v 


+  (grad  G 


-*  4(0) 

•U)  u  ; 


1  l^} 


(12) 


as  U  ranges  over  all  functions  in  u  satisfying  the  boundary  condition  (11) . 
In  (12),  w^  and  w^  are  weighting  functions  which  vary  with  spatial  locations. 


The  iterations  are  continued  until  the  corrections  are  suitably 

small.  One  attractive  feature  of  this  approach  is  that  typically  few  Newton 
iterations  are^  required.  In  addition,  the  least  squares  formulations  for  the 
corrections  U  lead  to  positive  definite  Hermitian  systems  which  can  be 
readily  solved  by  a  number  of  methods  ([11, [2]).  Finally,  the  approach  is 
not  sensitive  to  the  presence  of  supersonic  region  where  the  equations  are 
hyperbolic. 
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The  approach  does,  however,  have  a  major  drawback.  At  points  where  the 
solution  u  is  singular,  or  in  the  case  of  shocks,  where  it  has  very  large 
gradients,  there  is  a  serious  loss  of  accuracy  in  the  approximation.  This  is 
where  the  weighting  functions  w1/w2  Play  a  major  role. 

First  suppose  the  flow  region  has  a  re-entrant  corner  as  is  shown  in 
Figure  1.  If  the  flow  is  subsonic  at  the  corner  A,  then  it  is  known  that  u 


Figure  1.  Corner  singularity 
-a 

behaves  like  0 (r  )  for  a  suitable  number  a  >  0.  In  this  case  if  the  weights 

a 

Wl/W2  vanis^  A  like  0(r  ),  where  0  >  20.,  then  with  appropriate  mesh 

refinement  full  accuracy  can  be  achieved.  Numerical  experiments  [3]  indicate 
that  £he  correct  weights  are  essential.  Mesh  refinement  alone  is  not  sufficient 

A  similar  situation  exists  with  shocks.  Here  it  has  been  observed  that  it 
is  sufficient  for  the  weights  to  vanish  like  the  first  power  of  the  distance  to 
the  shock.  It  is  also  necessary  to  add  a  small  amount  of  dissipation,  either 
through  the  density  p  or  by  a  dash  pot  term. 

III.  A  NUMERICAL  EXAMPLE.  To  illustrate  the  above  ideas  we  consider  a 
time  periodic  flow  over  an  oscillating  plate  as  shown  in  Figure  2.  Boundary 
conditions  and  other  details  are  given  in  [3] .  An  exact  solution  can  be 


Figure  2.  Oscillating  plate 


obtained  in  the  case  of  a  small  time  periodic  disturbance  of  a  uniform  flow, 
and  we  compare  this  with  the  approximations  obtained  from  the  least  squares 
method.  The  velocity  field  (and  hence  the  mass  flow)  has  a  singularity  like 
-1/2) 

0(r  at  the  tip  of  the  plate  (point  A  in  Figure  2). 


583 


The  pressure  coefficient 


(which  is  proportional  to  the  mass  flow) 


is  plotted  in  Figure  3  as  a  function  of  the  distance  along  the  plate.  The 
solid  line  is  the  exact  solution.  The  circles  represent  the  weighted  least 
squares  approximation  with  mesh  refinement,  while  the  squares  represent  the 
unweighted  least  squares  approximation  on  the  same  grid.  The  serious  loss  of 
accuracy  in  the  latter  is  evident  as  the  singularity  at  A  is  approached. 
Moreover,  additional  mesh  refinement  only  makes  marginal  improvements.  However 
by  adding  the  weights,  which  in  this  case  behave  like  0(r),  there  is  a 
significant  improvement  in  the  accuracy. 

Additional  examples  are  given  in  £3)  including  those  containing  shocks.  . 


Figure  3.  Pressure  coefficient  along  plate 
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ABSTRACT*  Geometric  Programming  (GP)  can  solve  two  types  of 
problems:  non-linear  minimization  formulations  and  simultaneous 
non-linear  equations.  Since  a  large  number  of  problems  in  this  world 
are  non-linear,  there  are  numerous  potential  applications. 
Furthermore,  GP  can  provide  the  general  solution  to  a  given 
non-linear  problem  if  the  problem  is  reduced  to  what  is  called  zero 
degrees  of  difficulty.  Sample  solutions  to  some  problems  are 
provided. 

I.  INTRODUCTION.  Geometric  Programming  is  mathematically 
simple  to  use.  One  does  not  have  to  take  derivatives,  pivot,  or 
transform  into  some  exotic  N-space.  All  that  is  required  is 
high-school  level  algebra  and  a  bit  of  practice.  One  doesn't  even 
need  a  computer,  although  for  larger  problems  it  can  be  useful.  Many 
small  problems  can  be  solved  by  hand  for  the  general  case  without 
even  using  a  calculator. 

If  this  sounds  like  a  sales  pitch,  it  is.  Geometric  Programming 
can  be  a  nice  alternative  to  trying  to  stuff  a  non-linear  problem 
into  a  piecewise-1 inear  Linear  Programming  package,  or  using  a  huge 
Conjugate  Direction  package  on  a  relatively  small  non-linear  problem. 
Furthermore,  GP  doesn’t  run  forever.  When  compared  to  several 
different  non-linear  techniques,  Geometric  Programming  has  tended  to 
be  faster. 

This  is  not  to  imply,  however,  that  Geometric  Programming  is  the 
cure-all  of  every  non-linear  problem,  and  it  generally  doesn't  do 
well  against  linear  problems.  The  restrictions  on  the  uses  of  GP  are 
presented  below,  as  well  as  what  types  of  problems  Geometric 
Programming  really  prefers. 
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II,  Preferences  of  Geometric  Programming.  As  mentioned  above, 
the  two  types  of  problems  which  GP  can  solve  are : 

1)  non-linear  minimization  optimization,  and 

2)  solutions  to  simultaneous  non-linear  equations. 

Only  minimization  optimization  problems  can  be  solved  by  GP,  but  of 
course  one  can  always  transform  a  maximization  problem  into  a 
minimization  problem  by  suitably  transforming  the  objective  function. 
This  transformation  can  be  done  by  either  multiplying  the  objective 
function  by  minus  one,  or  by  inverting  the  objective  function 
(minimizing  the  reciprocal), 

GP  also  prefers  non-zero  solutions.  Since  many  non-linear 
problems  have  multiple  solution  points,  especially  in  simultaneous 
equation  problems,  one  does  not  know  a-priori  which  solution  will  be 
found.  However,  one  can  be  guaranteed  that  if  there  is  any  other 
solution  than  the  zero  solution,  GP  will  find  it.  This  factor  can 
sometimes  be  used  as  an  advantage  when  one  desires  to  find  multiple 
solution  points. 

Another  feature  of  GP  is  that  at  optimality,  all  of  the 
constraints  will  be  "tight".  In  this  context,  ’’tight"  means  that  all 
of  the  inequalities  will  be  equalities.  If  this  is  not  supposed  to 
be  the  case  at  optimality,  then  the  user  must  define  his/her  own 
artificial  variables,  just  as  is  done  in  Linear  Programming. 

Geometric  Programming  also  prefers  that  the  problem  is  solved 
once  in  the  general  case  rather  than  solved  iteratively.  (Don’t  we 
all!)  This  can  be  accomplished  when  the  problem  is  formulated  in  the 
proper  manner,  called  zero  Degrees  of  Difficulty.  This,  too,  will  be 
explained  later  in  more  detail.  The  point  we  wish  to  make  here  is 
that  sensitivity  analysis  is  made  trivially  easy  when  the  general 
solution  to  a  problem  is  known,  and  GP  allows  the  user  to  find  the 
general  solution  without  taking  derivatives. 

Problems  involving  trigonometric  functions  and  powers  to  powers 
cannot  yet  be  solved  by  GP.  Also,  there  is  no  particular  preference 
for  integer  solutions.  Unlike  the  Transshipment  problem  in  Linear 
Programming,  even  all-integer  coefficients  will  not  guarantee 
integer  solutions  in  GP.  There  is  no  way,  yet,  to  force  integer 
solutions  in  Geometric  Programming. 

There  are  a  couple  of  transformations  which  will  allow  the  user 
to  solve  non-linear  problems  which  include  logarithms  and  exponential 
functions.  Since  these  are  two  of  the  more  common  functions  in 
non-linear  problems,  these  transforms  come  in  handy.  We  will  not  go 
into  the. details  here,  but  they  may  be  found  in  Beightler  and 
Phillips. 

The  types  of  problems  which  GP  really  likes  are  those  with  lots 
of  variables  multiplied  together,  with  each  variable  to  a  positive  or 
negative  real  power.  For  example: 
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Minimize  2.45  *  X2'2*  Y1*3  +  3.14  *  X_3,1+  4.02  *  Y--5  (1) 

A  constant  times  a  variable  to  a  real  power  times  another  variables 
to  a  real  power,  plus  another  term  of  the  same  form,  is  the  type  of 
problem  formulation  Geometric  Programming  really  prefers.  Before 
explaining  the  solution  to  the  above  example,  let  us  first  define 
some  terminology. 

III.  Terminology.  The  following  definitions  are  used  in 
Geometric  Programming  in  order  to  simplify  the  discussions  to  follow. 


A  Term:  Any  group  of  variables  separated  by  a  plus  or  minus 

sign.  In  our  example,  there  are  three  terms,  one  involving 
X,  one  involving  Y,  and  one  involving  both  X  and  Y. 

An  Objective  Function:  That  which  is  to  be  minimized.  In  our 
example,  it  is  all  of  equation  (1). 

-A  Constraint:  That  which  must  be  satisfied  while  minimizing. 

There  are  no  constraints  in  our  example,  but  usually  they 
are  of  the  form  of  a  sum  of  terms  less  than  or  equal  to  a 
constant. 

A  Weight  or  Delta:  The  contribution  of  that  term  to  the  final 

solution.  Th  our  example  there  would  be  three  weights — one 
for  each  term.  At  optimality,  the  first  delta  would  be 
equal  to  the  contribution  of  the  first  term  to  the  overall 
value  of  equation  (1).  This  will  turn  out  to  be  a  number 
greater  than  zero  and  less  than  one. 

Degrees  of  Difficulty :  This  is  a  measure  of  how  difficult  it  is 
to  solve"  the  problem  in  its  current  formulation.  This 
measure  is  defined  to  be  equal  to: 

Number  of  Terms  -  Number  of  Variables  -  1 

This  is  similar  to  having  the  same  number  of  equations  as 
unknowns  in  order  to  solve  for  the  variables.  The  Degrees 
of  Difficulty  are  ranked  by  number. 

Zero  Degrees  of  Difficulty  (ODD)  is  good.  One  can  find  the 
general  solution  to  the  problem  if  the  sign  of  all  the  coefficients 
is  positive,  (If  any  are  negative,  you  have  to  check  to  see  if  you 
are  at  a  minimum,  a  maximum,  or  a  saddle  point.)  If  the  Degree  of 
Difficulty  is  one  ( 1DD) ,  one  can  find  the  general  solution,  but  only 
after  taking  logs,  derivatives,  and  exponentiating.  This  is  messy 
and  is  not  usually  done  unless  the  genral  solution  is  required.  Any 
higher  number  for  Degrees  of  Difficulty  requires  that  iterative 
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solutions  be  used  in  order  to  find  the  optimal  solution.  Using 
iterative  solutions  does  not  guarantee  a  global  optimal  solution,  but 
it  usually  works  out  that  way.  Furthermore,  the  number  of  iterations 
required  by  GP  is  small  compared  with  most  other  search  algorithms. 

It  has  been  found  that  GP  performs  at  least  as  well  as  Newton's 
Method  applied  to  a  log  transform  of  the  original  problem,  and  is 
probably  better.  Research  continues  on  this  subject. 

IV.  The  Rules  of  Geometric  Programming.  There  are  four  rules 
in  Geometric  Programming  which  result  from  the  theory  behind  GP. 

These  shorthand  rules  allow  the  user  to  solve  properly  formulated  GP 
problems.  They  are  listed  below: 


1) 


2) 


3) 


4) 


Rule  1  is  the  value  of  the  objective  function  at  optimality, 
represented  by  0*  (phi  star).  The  K's  are  the  coefficients  if  each 
of  the  terms  in  the  whole  problem,  the  Deltas  are  the  weights  for 
each  term.  The  superscripts  and  subscripts  show  that  there  are  N 
terms  in  the  objective  function,  and  JL  terms  in  each  of  the  m 
constraints.  The  deltas  in  the  constraints  have  two  subscripts  to 
represent  the  kth  term  in  the  jth  constraint.  We  shall  show  in  our 
examples  how  to  solve  for  the  weights  or  deltas.  Once  these  deltas 
are  known,  then  the  optimal  solution  is  known  by  solving  for  phi  star 
in  Rule  1,  This  means  that  the  final  solution  is  known  prior  to  the 
values  for  the  variables  are  determined.  Again,  no  pivoting  is 
required. 

Rule  2  restates  the  percentage  rule — each  delta  of  the  objective 
function  will,  at  optimality,  be  equal  to  the  contribution  of  the 
term  with  which  it  is  associated.  If  the  first  term  contributes  507o 
of  the  value  of  equation  (1),  for  example,  then  the  first  delta  will 
equal  one  half.  Since  the  sum  of  the  parts  must  equal  the  whole,  the 
sum  of  the  deltas  of  the  objective  function  must  equal  one.  This 
rule  also  provides  an  extra  equation  with  which  to  solve  for  the 
deltas,  and  is  responsible  for  the  negative  one  in  the  definition  of 
Degrees  of  Difficulty. 

Rule  3  Is  a  restatement  of  the  definition  of  the  deltas — each 
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delta  represents  the  contribution  of  its  term  to  the  whole  objective 
function.  Rule  3  is  used  to  determine  the  optimal  values  of  the 
variables  after  the  optimal  solution  has  been  derived.  In  our 
example,  the  variable  X  may  be  determined  by  using  Rule  3  applied  to 
the  second  term.  Since  all  values  are  known  except  for  X,  one  simply 
solves  a  single  term  monomial  to  determine  X, 

Rule  4  is  similar  to  Rule  3,  except  that  Rule  4  holds  for  deltas 
in  the  constraints  rather  than  in  the  objective  function.  The 
contribution  of  the  kth  term  in  the  jth  constraint  is  equal  to  its 
associated  weight  divided  by  the  sum  of  all  of  the  weights  in  the 
constraint.  This  is  used  whenever  it  is  more  difficult  to  solve  for 
the  variables  using  Rule  3  than  it  is  to  use  Rule  4. 

V.  Solving  For  the  Weights  or  Deltas:  Now  that  we  have 

determined  terminology  and  how  to  solve  the  problem  given  the 
appropriate  weights,  let  us  now  examine  how  to  solve  for  those 
weights.  In  our  example,  we  have  three  terms  and  two  variables,  so 
we  have  a  zero  Degree  of  Difficulty  problem.  Furthermore,  by  Rule  2, 
we  know  that  the  sum  of  the  deltas  must  equal  one.  This  gives  us  our 
first  equation: 


<2^  +  <r2  +  <r3  =i  (2) 

Notice  that  since  the  deltas  are  the  unknowns,  we  will  need  two  more 
equations  in  order  to  solve  for  the  deltas  explicitly. 

Next,  we  look  to  see  in  which  terms  the  variable  X  appears.  If 
X  does  not  appear  in  the  term,  then  the , coefficient  for  the  delta  in 
our  next  equation  is  zero.  If  X  does  appear  in  the  term,  then  the 
coefficient  of  the  delta  is  equal  to  the  exponent  of  the  variable  X. 
In  our  example,  ' 


2.2  *  d"1  -  3.1  *  ar2  =0  (3) 

and  the  right  hand  side  is  defined  to  be  equal  to  zero. 

The  same  thing  is  done  for  Y  as  was  done  for  X.  In  our  example, 


1.3  *  <T1  .  5  *  d~3  =  0  (4) 

which  completes  the  number  of  required  equations  with  which  to  solve 
for  the  deltas.  Notice  that  even  though  the  problem  as  originally 
defined  was  non-linear,  the  solution  to  the  problem  of  finding  the 
weights  for  each  term  is  a  linear  problem.  Furthermore,  no 
derivatives  had  to  be  taken  an dT no  pivoting  was  required. 

The  values  of  the  deltas  in  the  above  problem  are: 

=  .232036,  <T2  -  *16^67>  and  cT^  =  -60329 

From  Rule  1  we  find  that  the  optimal  solution  equals  8.81597,  and  by 
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applying  Rule  3  to  the  second  and  third  terms,  X  equals  1.28256  and  Y 
=  .57129.  All  that  was  used  to  solve  this  problem  was  a  little  paper 
and  a  hand  calculator.  The  solution  was  verified  by  substituting  the 
optimal  values  for  X  and  Y  back  into  the  original  equation,  and 
seeing  if  that  number  matched  the  solution  given  by  GP.  One  can  also 
assure  that  the  solution  is  a  minimum  by  changing  the  value  of  X  or  Y 
slightly  in  equation  (1)  and  showing  that  the  result  is  an  increase 
in  the  value  of  the  objective  function. 

VI.  An  Example  Comparing  Lagrange  Method  to  GP:  In  the 

following  example,  we  "will  tirst  solve  the  problem  by  means  of  the 
Lagrange  Method  which  requires  the  taking  of  three  partial 
derivatives  to  find  the  optimal  value.  Then  we  will  show,  how  the 
same  solution  can  be  accomplished  through  Geometric  Programming. 

The  problem  is  to  minimize  the  amount,  and  therefore  the  cost, 
of  material  required  to  build  a  cylindrical  storage  tank  which  is 
constrained  to  contained  at  least  lOOOtf  cubic  yards  of  liquid.  The 
pi  is  obviously  there  to  make  the  solution  come  out  as  nice  round 
numbers.  The  problem  statement  and  results  are  as  follows: 


MIN  $  »  $1TR2  +  $21TRh 

s.t.  TTR2h  >  1000 IT  CUBIC  YARDS 

L(h,R,  A)  3  F(H,R)  -  A(g(h,R)-Ki) 

r3  =  1000  R*  -  10  h*  -  10 

COST  =  trR2+2TrRH  =  $300  IT  =  $942 

In  order  to  find  the  solution,  the  Lagrange  (  L(H,R,  X  )  )  was 
required  to  have  the  partial  derivative  taken  with  respect  to  R,  H, 
and  X  .  These  partials  were  set  equal  to  zero,  and  the  resulting 
three  equations  and  three  unknowns  (  R,  H,  X  )  were  solved.  The 
results  were  that  the  radius  and  the  height  of  the  cylinder  (with  no 
bottom — the  bottom  is  a  sunk  cost)  are  both  equal  to  10  yards.  The 
total  cost  of  the  operation  was  $942. 

In  order  to  solve  the  problem  by  Geometric  Programming,  one  must 
first  get  the  problem  in  the  proper  form.  All  of  the  constraints 
must  be  of  the  form  where  all  of  the  terms  are  less  than  or  equal  to 
one,  as  shown  below. 
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MIN  I  -  t  TrR2  ♦  $2  n-RH 

s.T.  1  >  1000  fl-  R'2  H'1 

rr 


^*4- r  ?s**as  ss^^rca'Sfss  kss.  \l 

SiiTLS  3eiS/SrSfi  If* l>  *"d  £lnd  ch*  val"*s  £« » » 

MIN  $  =  $  it  r2  +  $2  rr  Rh 
s.T.  1  2  1000  rr  R'2  H'1 


s\  +  <r2  =  l 

R  :  2^  +  ^2  *  ■  0 

H  :  0«*X  +  <*2  *  ^  =  0 


i i  a  1/3  S2  *  2/3  ^  =  2/3 

(^‘=  (3H),/3(3ir}2/3(l0Q0)2/3  =  3IT100  =  J9R2 


•n  Simultaneous  Equation  Problems:  The  next  exanmle  uill 

illustrate  how  to  solve  tor  Cwo  variables  in  two  non-linear 
equations.  We  will  use  the  method  created  by  Baker  3  which  ensures 
that  all  of  the  coefficients  will  be  Dositive  Tho  enS^r?Si. 

Aliens  allows  for  negative  coefficient!  and^omSimes  fSds^a  ^ 

SeUetoSethetUfo  that^roblims  ?°lu5io"  Poin!=-  The  two  methods  work 
the  othlr  hal’no  prSbLtS  aU.f  WhlCh  ^  haS  di£fi^lty  solving, 
The  problem  Is  stated  as: 


*1*2 


9 


3Xx2X3  -  X23  -  4 

which  is  placed  in  proper  format  by  solving  for  the 
to  be  equal  to  one. 


right  hand  side 


5  xl2  +  I  xiV 


1 


4  > 


1 
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The  calculations  are  provided  in  Appendix  A.  In  summary,  one  first 
performs  the  operation  known  as  "Condensing”  on  the  two  equations, 
and  seperately  on  the  numerator  and  denominator  if  the  equation  is  a 
ratio.  To  condense,  one  simply  chooses  a  value  for  each  of  the  two 
variables  (guess  a  solution)  and  determine  how  much  each  of  the 
variables  contribute  to  each  equation  at  these  chosen  values.  These 
are  used  as  weights  in  order  to  solve  for  the  variables.  These  new 
values  for  the  variables  are  then  substituted  back  into  the  original 
equations,  and  new  weights  are  determined.  These  weights,  in  turn, 
are  used  to  solve  for  new  values  for  the  variables  and  the  pattern 
keeps  repeating  until  the  change  from  one  iteration  to  the  next  is  as 
small  as  desired. 

The  advantage  is  that  the  original  guess  by  the  user  does  not 
have  to  be  accurate  at  all.  One  can  choose  a  value  of  1000  for  the 
first  variable,  and  in  one  iteration,  the  value  is  of  the  order  of 
magnitude  (one)  of  the  optimal  solution.  One  solution  to  this 
problem  is: 


»  1.33636 
X2  =  1.75424 


VIII.  A  Final  Example:  As  a  final  example,  we  would  like  to 

present  a  problem  presented  to  us  by  an  analyst  at  the  Concepts 
Analysis  Agency,  The  problem  was  to  create  an  allocation  scheme  by 
which  aircraft  could  be  allocated  to  either  near  or  far  targets  to 
best  reduce  the  enemy's  capabilities.  The  key  restriction  was  that 
whatever  scheme  was  used  should  be  flexible  enough  to  adapt  to 
anyone's  rational  idea  of  how  they  would  perform  the  allocation 
scheme.  This  was  necessary  since  not  only  would  analysts  differ  as 
to  which  scheme  should  be  used,  but  also  the  commanders  who  were 
being  modelled  would  also  be  likely  to  differ  in  their  allocation 
schemes.  Therefore,  a  truly  flexible  scheme  was  required. 

The  following  formulation  was  proposed.  Let  Ki  be  the  enemy 
force  near  to  friendly  forces,  and  let  K2  represent  the  forces  farther 
away.  The  second  number  could  be  discounted  over  time  or  distance  or 
both.  Let  Nj  represent  the  number  of  aircraft  allocated  to  near 
targets,  and  N2  represent  the  number  of  aircraft  allocated  to  far 
targets.  The  sum  of  the  aircraft  must  be  less  than  or  equal  to  the 
total  number  of  aircraft  available,  N.  The  objective  is  to  minimize 
the  total  amount  of  (discounted)  force  facing  the  friendly  forces, 
and  this  is  represented  by  decreasing  the  enemy  force  by  the  number 
of  aircraft  allocated  to  some  power  -a.  We  assume  for  a  moment  that 
enemy  air  defense  threat  is  the  same,  that  <*  is  the  same  for  both 
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near  and  far  targets,  and  no  aircraft  need  be  held  in  reserve. 
The  formulation  and  solution  are  given  below: 


MIN 

“A* 

+ 

*/2a 

S.T. 

Nl  +  N2 

< 

N 

1 

LET  B  - 

(*2  ' 

\  i  +a 

K1  / 


=  _ N _ 

1  +  b 


N/  =  NR 

1  +  B 


Notice  that  since  there  are  four  terms  and  only  two  variables 
this  is  a  one  Degree  of  Difficulty  problem.  As  mentioned  previously 
logarithms  and  derivatives  need  to  be  taken.  Once  again,  we  will  not 
go  into  the  gory  detail  here.  The  important  point  is  to  examine  the 
sensitivity  of  the  solution  and  its  formulation.  Let  alpha  equal  0. 
This  results  in  the  standard  "weighted”  allocation  scheme  where  the 
portion  allocated  to  each  target  is  same  as  the  proportion  of  each 
target  to  the  total  threat. 

.  ^  we  were  to  let  alpha  equal  1,  we  would  have  a  "modified” 

weighting  scheme,  where  the  portion  allocated  to  each  target  is  a 
function  of  the  magnitude  of  each  threat.  Similarly  for  higher 
values  of  alpha.  If,  instead,  we  were  to  let  alpha  equal  -1,  then 
all  aircraft  would  be  allocated  to  whichever  threat  was  larger.  This 
is  consistent  with  the  fact  that  when  alpha  equals  -1,  the  original 
problem  is  linear.  The  solution  to  a  linear  problem  with  two 
variables  is  always  to  find  the  "corner"  solution  point;  that  is,  to 
allocate  everything  to  the  varaible  with  the  highest  payoff. 

Fmally ,  if  we  were  to  let  alpha  equal  infinity,  then  half  of 
the  aircraft  would  go  to  the  near  target,  and  half  to  the  far  target 
no  matter  what  the  values  of  the  coefficients.  These  four  results  ’ 
are  summarized  below. 


[F  (l  =  0;  =  N  *1  ,  N2  =  K2 

K1  *  *2  K}  +  k2 


IF  Cl  =  1;  =  N 


VicT 


K1  +Vr^2~ 
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IF  a 


=  -1;  -  0,  r*2  =  f*  IF  k2  >  K1 

=  \i*  ft i  =  0  if  «2  >  k2 

IF  a  =oo;  Hi  -  I  j  rf2  35  4 

2  2 


The  advantage  to  the  above  allocation  scheme  is  that  it  is 
flexible  enough  to  contain  all  of  the  "standard”  allocation  schemes 
used — a  weighted  scheme,  all  or  nothing,  or  half  and  half. 

Furthermore,  by  having  solved  for  the  general  solution,  one  can 
calculate  the  optimal  allocation  given  alpha  without  having  to 
re-derive  the  problem  each  time  alpha  is  changed.  This  also  allows 
alpha  to  change  value  during  the  operation  of  the  simulation  without 
large  blocks  of  code  being  required  for  each  scheme. 

IX.  Conclusions:  "If  the  world  were  linear,  Dantzig  would  be 

king.71  This  quote  is  based  upon  the  fact  that  Dantzig  was  chosen  to 
compile  what  was  known  about  Linear  Programming  into  the  classic  book 
on  the  subject.  Solutions  to  linear  problems  are  relatively  easy  to 
obtain  with  numerous  packages  available.  However,  the  world  is  not 
linear,  and  a  large  portion  of  the  problems  in  this  world  do  not  fit 
well  into  linear  programs.  One  can  theoretically  always  turn  a 
non-linear  problem  into  a  very,  very  large  piecewise  linear  problem, 
but  this  is  awkward.  There  exist  a  number  of  non-linear  packages 
which  occupy  large  sections  of  core  when  running,  and  sometimes  run 
forever. 

Geometric  Programming  provides  a  nice  alternative  to  solving 
non-linear  problems,  especially  ones  which  include  a  managable  number 
of  variables  (say  less  than  twenty)  and  whose  exponents  are  not  all 
integers.  Larger  problems  are,  of  course,  solvable  by  Geometric 
Programming,  but  one  should  then  examine  the  GP  computer  packages 
available,  or  consider  writing  a  program  to  solve  that  unique 
problem. 
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APPENDIX  A:  SAMPLE  SOLUTION  TO  A  SIMULTANEOUS 
NON-LINEAR  EQUATION  PROBLEM 


The  problem  as  given  stated: 


Xl2  +  HX23  =  9 

3X12x2  "  x23  =  4 

The  condensed  formulation  of  the  problem  is  obtained  as 
follows: 


1 

9 


+ 


1  V  V  3 

9  X1X2 


1 


4 


+  X, 


2 

3 


1 


So  condensing  yields: 


<|)  (w 


11 


w 


12 


3X1  X2 


(^_)W21  V  W22 


-  1. 


kw 


21 


’22 


1 
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Taking  logs  and  setting  up  the  matrix  form 
gives : 


of  the  system 


(2W11  +  Wi2>  <3W„) 

(2)  (] 


12' 

' 

22‘. 


~ln  (X1)' 

1 

In  (X2) 

’12 


L-^  (3(^)W21  (H22,W22, 


Using  a  starting  point  of 


X1  -  50,  X2  =  20,  and  letting 


*1  -  <5>  <<£->"n  <^)”12 


and 


W2 1  W?  1  W,  , 

K2  =  3<-§i)  (W22)  22 


2  w  v— 4 

results  in: 


X, 


"U  ‘  Xl!  *  *1*23  " 
=  0.00621 


2500 


2500  +  400000 


w 


X1X2 


400000 


12  x^2  +  x  X  3  2500  +  400000 

=  0.99379 


W 


21  4  +  X  3  4  +  8000 

2 

=  0.00050 
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8000 


w. 


22 


4  +  x  ^  ^  +  8000 


=  0.99950 


0.00621 


K.  =  (i)  ( - L  '  ~ . .  ,  i  0.99379 

1  9  0.00621  (TO37?) 


=  0.11538 


K2  =  (3)  (0^05/- 0005 


(0.9995)  °'9"5 


Solving  th,  system  (2. 31J  for  fs^.  and  yUUa 

In  (X1)  =  0.26179 

Ln  (X2)  =  0.68014 
so 

=  1.29925 
X2  =  1.97415. 


The  process  is  then  repeated  by  calculating  new  weights 


and  continuing  as  before. 

provided  below: 

A  summary  of 

the  iterations 

Iteration 

xi 

x2 

0 

50.0000 

20.0000 

1 

1.2992 

1.9742 

2 

1.3304 

1.7645 

3 

1.3363 

1.7543 

4 

1.3363 

1.7542 
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ABSTRACT 

Dr.  Nazir  Warsi,  in  recent  work,  showed  how  to  solve  certain 
dynamic  programming  problems  while  keeping  strict  bounds  on  the 
amount  of  working  storage  needed.  We  discuss  extensions  of  Dr. 
Warsi's  methods  and  analysis  to  more  general  dynamic  programming 
networks.  We  describe  a  general  algorithm  for  solving  problems  in 
this  more  general  class.  This  algorithm  may  be  applied  in  such  a 
way  as  to  limit  working  storage  arrays  to  any  dimensions  greater 
than  or  equal  to  2.  In  making  this  restriction,  there  are  two 
costs:  a  number  of  arrays  of  dimension  2  may  need  to  be  stored 
simultaneously;  searches  for  maxima  can  become  arbitrarily 
complex  with  the  complexity  of  the  network. 

We  discuss  the  implementation  of  the  general  algorithm  in  a 
higher  level  language  with  particular  emphasis  on  storage  management. 
We  also  discuss  data  representations  and  the  practicality  of  imple¬ 
menting  a  system  for  handling  general  networks. 


The  storage  and  computational  requirements  of  dynamic  programm¬ 
ing  algorithms  have  limited  their  practical  uses.  The  storage  and 
computational  demands  can  become  excessive  whenever  the  state  vec¬ 
tors  are  of  dimension  3  or  more,  or  if  the  number  of  states  grow 
exponentially.  The  purpose  of  the  research  described  in  this  paper 
is  to  implement  dynamic  programming  solutions  for  certain  basic 
kinds  of  networks  in  a  manner  that  will  keep  the  working  storage 
needs  to  a  minimum  with  a  possible  trade-off  in  increased  computing 
time. 


The  types  of  networks  considered  in  this  study  are  converging 
branch,  diverging  branch,  feed-forward  loop,  and  feed-back  loop. 
In  this  note,  we  will  present  the  implementation  and  analysis  of 
alsorithms  for  the  feed-forward  loop  system.  For  further  informa¬ 
tion,  refer  to  the  reports  of  Warsi  and  Esogbue  in  the  references. 
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k  X 


Feedforward  Loop  System  Diagram 


Each  node  in  a  dynamic  programming  network  represents  a  deci¬ 
sion  point.  The  decision  is  a  finite  valued  variahle,  values 
taken  to  be  integers  1  ..  pj  for  node  i.  Edges  connecting  nodes 
are  directed  indicating  a  flow  from  one  node  to  another.  The 
edges  may  carry  one  of  a  finite  number  of  state  values,  taken  to 
be  integers  from  1  ..  k-j  for  edge  i.  For  each  node,  functions 
are  defined  determining  the  output  states  (on  edges  leaving  the 
node)  and  a  return  (a  numeric  value).  The  function  values  depend 
on  input  states  and  on  the  decision  value  for  that  node.  The 
solution  of  a  network  consists  of  a  choice  of  input  state  for  each 
edge  with  external  source  and  a  choice  of  decision  for  each  node. 
Further,  these  choices  will  maximize  the  sum  of  the  returns  for 
all  nodes  in  the  network  over  all  possible  such  choices. 

The  output  state  functions  and  the  return  functions!  can  be 
represented  by  matrices,  or  higher  dimension  analogues.  Vie  will 
use  rj  to  denote  the  return  function  (array)  for  node  i  and  tj  or 
t'i  to  denote  the  output  functions  (array)  for  node  i.  Note  that 
t'  will  only  be  defined  for  nodes  with  two  output  edges. 

The  strategy  for  solving  a  network  is  to  iteratively  combine 
return  functions  for  nodes  into  functions  representing  the  total 
return  possible  from  groups  of  nodes.  The  value  of  such  a  combined 
return  function  will  depend  only  on  state  values  of  edges  that 
enter  the  represented  group  of  nodes  from  outside  the  yroup  or  that 
leave  the  represented  group.  In  particular,  the  value  will  not 
depend  on  state  values  on  edges  between  nodes  within  the  yroup. 

for  the  feed-forward  loop  network,  a  starting  function  is 
computed  which  represents  the  maximal  return  from  node  1  given  the 
input  xi.  This  function  is  combined  with  returns  from  nodes  back 
to  node  s  -.1.  During  all  of  this,  the  function  can  be  represented 
by  a  one-dimensional  array.  Upon  combining  with  the  return  from 
node  s,  the  function*  depends  on  two  arguments  (xs_i  and  xq]),  and 
is  represented  by  a  two-dimensional  array.  Returns  from  nodes  on 
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the  parallel  portions  of  the  network  are  then  combined  until  node  j 
is  one-dimensional  array.  Further  iterations  produces  a  function 
representing  the  group  of  all  nodes  in  the  network,  return  depending 
on  xn,  Maximizing  over  xn  produces  the  network  maximal  return. 

Analysis  of  the  work  done  to  this  point  shows  that  the  working 
storage  needed  is  simply  the  representation  of  the  current  combined 
function.  If  k  is  the  maximal  number  of  states  on  any  edge,  this 
amount  is  proportional  to  k2,  since  a  two-dimensional  array  was 
needed.  However,  even  though  the  maximal  total  return  from  the 
network  is  known,  and  the  free  input  xn  is  known,  the  decisions 
necessary  at  each  node  are  not.  Approaches  to  determining  these 
decisions  fall  on  a  spectrum  between  these  two  extremes:  store  for 
each  node,  as  its  return  is  entered  into  the  combined  function,  the 
functional  dependance  of  the  decision  made  on  the  input  states  to 
the  group;  repeat  the  optimization  computation  above  retaining 
functional  information  about  the  decision  in  only  the  last  node  pro¬ 
cessed.  The  latter  approach  works  since  the  optimization  computa¬ 
tion  is  carried  out  for  one  less  node  at  each  stage.  The  former 
approach  requires  a  large  amount  of  storage,  depending  on  the  number 
of  nodes  in  the  network,  among  other  variables.  The  latter  approach 
requires  a  large  amount  of  time,  but  only  the  working  storage  needed 
for  the  optimization  computation  (proportional  to  k*). 

One  of  us  (Warsi,  loc.  cit.)  has  developed  a  variation  on  the 
recomputation  approach  that  cuts  down  the  number  of  recomputations 
needed  considerably.  In  this  method,  optimal  decisions  are  stored 
as  functions  of  the  group  input  states  if  all  such  states  are  on 
edges  to  the  node  being  entered  into  the  combined  function.  Extra 
storage  is  not  needed  in  this  case,  as  the  (or  t-j 1 )  matrix  can 
be  used  by  marking  appropriate  entries.  For  example,  if  there  is 
one  input  edge  and  one  output  edge,  and  the  combined  function  being 
computed  depends  only  on  the  state  value  on  the  input  edge,  then 
the  marking  method  is  as  follows.  For  each  input  state,  mark  that 
entry  in  the  corresponding  state  row  of  tj  that  is  in  the  column  of 
the  optimal  decision  for  that  input  state.  Since  the  entries  in 
the  t  matrix  are  no  larger  than  k  (the  maximal  value  of  any  state), 
this  marking  can  be  done  by  adding  k  to  the  entry.  Doing  this,  the 
recomputations  can  be  cut  down  to  four,  regardless  of  the  number 
of  nodes  in  the  network.  Similar  savings  in  time  and  space  are 
achieved  for  the  other  network  types  by  using  this  method  of  saving 
decisions  with  limited  recomputation. 

The  most  interesting  problems  that  arise  in  actually  implement¬ 
ing  the  algorithms  developed  for  the  non-serial  networks  is  the 
representing  of  the  arrays  in  storage.  We  wished  to  represent  the 
network  and  r  and  t  functions  in  a  uniform  way  that  could  be  operated 
on  whatever  the  distribution  of  kj  and  pj .  Although  PASCAL,  the 
language  we  chose  for  implementation,  does  allow  dynamic  allocation, 
array  sizes  must  be  specified  at  compile  time.  (This  will  vary  for 
different  versions  of  PASCAL).  We,  therefore,  chose  (influenced  by 


our  FORTRAN  background)  to  use  one  large  one-dimensional  array  to 
store  all  information  in.  One  extra  array  is  used  to  point  to  the 
beginning  of  the  information  for  each  node  in  the  big  array. 
Various  selector  and  set  functions  can  then  be  written  to  access 
the  information.  If  the  algorithms  are  moved  to  a  different  lan¬ 
guage  (ADA,  for  example),  the  various  arrays  could  then  be  allo¬ 
cated  separately,  using  pointers  to  link  information  together.  Due 
to  the  sequential  nature  of  the  processing,  double  links  between 
nodes  would  suffice  to  move  around  as  needed  within  the  network. 

We  have  made  some  investigation  of  more  complex  networks. 
Work  done  by  Onukwuli  (see  reference)  had  quantified  some  of  the 
problems  in  the  main  computation  that  arise  from  attempting  to  keep 
the  combined  function  of  low  dimension.  It  also  appears  that,  in 
non-serial  networks,  that  there  are  certain  key  points  at  which 
combined  functions  can  be  saved  for  future  reference  that  will  cut 
down  on  recomputation  time.  We  hope  that  this  will  lead  to  reason¬ 
ably  space  efficient  methods  for  solving  very  complex  dynamic 
programming  networks. 
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MATHEMATICAL  ANALYSIS  OP  THE  COUNTERFIRE  DUEL; 
TANKS  VS.  ANTI-TANK  MUNITIONS 

Joseph  V.  Michalowicz 
Tactical  Nuclear  Warfare  Branch 
Harry  Diamond  Laboratories/  USA  ERADCOM 
Adelphi,  MD  20  783 


ABSTRACT.  A  detailed,  analytic  model  is  developed  to  represent  the  duel 
between  a  ground  laser  designator  (GLD)  directing  a  sequence  of  laser-guided 
rounds  against  a  platoon  of  target  tanks  which  counterfire  against  the  GLD. 
The  model  accurately  portrays  the  complex  interplay  between  the  designator-on 
time,  the  rate  of  fire  of  the  laser-guided  rounds  and  the  tank  counterfire 
response  time  distributions.  Also  taken  into  account  are  the  tank  aiming 
errors  and  range  estimating  techniques,  the  level  of  GLD  protection,  flight 
times,  designation  modes  and  degree  of  coordination  of  the  tank  platoon. 

One  of  the  innovations  of  this  model  is  the  utilization  of  gamma 
distributions  to  represent  tank  counterfire  response  times.  This  permits  the 
representation  of  the  time-to-fire  for  any  number  of  tank  rounds  within  the 
designator-on  time  interval  by  convolution  of  the  gamma  density  functions. 
Exact  expressions  which  allow  for  all  encounter  outcomes  are  derived  for 
confutation  of  the  expected  number  of  tanks  killed  and  probability  of  GLD 
kill. 


This  model  was  used  to  determine  the  probabilistic  outcomes  of  the 
encounters  at  each  stage  of  the  force -on -force  analysis  in  the  definitive  U.S. 
Any  study  on  GLD  survivability. 

I .  INTRODUCTION .  The  development  of  laser-guided  missiles  and 
projectiles  has  provided  a  new  dimension  to  the  battlefield  environment  for 
tanks.  From  a  remote  ground  or  airborne  forward  observer  (FO)  position,  a 
laser  designator  operator  illuminates  a  target  tank  with  a  directed  laser  beam 
(see  fig.  1).  The  laser  energy  reflected  from  this  spot  on  the  target  then 
enables  the  seeker  to  guide  the  missile/projectile  to  the  target  tank.  If 
able  to  detect  the  presence  of  the  laser  designation,  the  tank  (and  its 
support  units)  may  use  various  countermeasures  (CM),  such  as  taking  evasive 
action,  using  smoke  or  chaff  to  disguise  the  tank*s  position,  generating  false 
target  images  to  deceive  the  missile  seeker,  dazzling  the  FO  with  flashlamps, 
and/or  directing  counterfire  against  the  FO,  the  missile  launching  platform, 
or  the  missile  itself.  This  report  presents  a  rigorous  analysis  of  the  Tank 
Counterfire  Duel,  in  which  laser-guided  missiles  are  directed  against  a  tank 
or  tank  platoon  via  a  ground  laser  designator  (GLD),  and  the  tanks  detect  the 
laser  radiation  and  counter  by  firing  their  main  guns  in  an  effort  to  destroy 
the  GLD. 
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Figure  1.  Laser-guided  missile  versus  tank  duel. 


The  analytic  model  which  will  be  derived  takes  into  account  tank 
counterfire  response  time  distributions,  designator-on  times,  tank-to-GLD 
range,  missile/pro jectile  single-shot  kill  probabilities  (SSKP) ,  tank  round 
SSKP  against  the  GLD,  tank  fire  control  errors,  GLD  protection,  ground  slope 
at  the  GLD,  missile  and  projectile  flight  times  and  rates  of  fire,  tank  round 
flight  times,  and  degrees  of  coordination  of  the  tank  platoon*  Special 
features  which  make  this  model  unique  include  the  following! 

1*  Probability  distributions  are  derived  to  represent  tank  times -to- 
fire*  In  particular,  gamma  density  functions  are  fit  to  the  times-to- 
first-fire  and  time-between-fires  data.  The  time-to-nth-fire  distribution 
is  then  obtained  by  convolution  of  the  time-to-f irst-f ire  density  with  n  - 
1  copies  of  the  time -between-f ires  density. 

2.  Any  mmiber  of  tank  fires  are  allowed  during  the  designator-on 
time  interval. 

3.  Flight  time  of  the  tank  round  is  included. 

4.  Various  GLD  positions  and  protection  levels  are  considered. 

5.  Analytic  expressions  which  incorporate  the  time-to-f ire 
distributions  and  allow  for  all  encounter  outcomes  are  formulated  for 
confutation  of  the  expected  msnber  of  tanks  killed  and  probability  of  GLD 
kill*  This  is  not  a  simulation  model! 

6.  In  addition  to  the  one  round  versus  one  tank  duel,  the  encounter 
between  three  laser -guided  rounds  versus  a  platoon  of  three  tanks  is 
analyzed,  with  attention  paid  to  the  level  of  coordination  of  the  tanks. 

The  data  base  used  for  the  model  was  derived  from  the  totality  of  U.S* 

Army  field  epq>eriments  on  tank  counterfire  response  and  GLD  kill.  Most  of 
these  data  cannot  be  detailed  in  this  paper  due  to  classification,  but  the 
form  of  the  data  will  be  discussed.  GLD  suppression,  that  is,  degradation  in 
GLD  crew  performance  due  to  counterfire  near-misses  and  obscuration,  could  not 
be  included  due  to  a  lack  of  proper  experimental  data,  even  though  the  model 
could  be  adapted  to  handle  suppression. 

Because  of  Congressional  concern  over  the  survivability  of  ground  laser 
designators  on  the  battlefield,  a  special  task-  force,  the  Survivability  Study 
Task  Force  for  Ground  Laser  Designators  (SSTF),  was  established  to  answer  the 
survivability  question  once  and  for  all.  The  SSTF  performed  a  two-stage 
analysis  of  the  physical  survivability  of  the  GLD.  First,  the  model  described 
in  this  paper  was  used  to  provide  a  detailed  analysis  of  the  outcomes  of 
counterfire  duels  played  under  a  wide  variety  of  conditions.  These  duels  were 
then  placed  in  a  realistic  battlefield  context  as  the  fixed-piece  engagements 
occurring  in  a  force-on-force  map  exercise  pitting  one  Blue  conpany  on  the 
defense  against  a  Red  tank  threat.  The  resulting  battlefield  engagement 
assessment1  provided  a  thorough  and  credible  answer  to  the  GLD  survivability 
question,  which  has  been  cited  frequently  in  Congressional  testimony.  In 
addition  to  the  duel  between  one  laser-guided  round  and  one  tank  (1-on-1)  and 
the  encounter  between  three  rounds  fired  in  sequence  and  a  platoon  of  three 
tanks  (3-on-3)  discussed  in  this  paper,  models  for  other  encounter 
combinations  were  derived2  for  use  in  the  SSTF  map  exercise. 

*Thomas  J.” Gleason,  Joseph  V.  Michalowicz,  Morgan  G.  Smith,  and  Richard 
Scungio,  Final  Report  —  Survivability  Study  Task  Force  for  Ground  Laser 
Designators,  Harry  Diamond  Laboratories,  HDL-TR-1860  {December  1982). 

^Joseph  V*  Michalowicz,  Analysis  of  the  Laser-Guided  Missile/Projectile  versus 
Tank  Counterfire  Duel,  Harry  Diamond  Laboratories,  HDL-TR-1854  (May  1978). 
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III.  PROBLEM  VARIABLES*  The  essence  of  the  encounter  situation  is  best 
understood  from  the  following  parameters  essential  to  the  analysis. 

Designator-tank  range.  For  survivability  reasons,  the  operator  of 
the  laser  designator  would  prefer  to  operate  at  as  great  a  distance  from  the 
tanks  as  possible,  because  the  greater  the  range  the  less  effective  are  the 
tank  rounds  counterfired  against  the  GLD.  However,  there  is  an  upper  limit  on 
this  range  imposed  by  the  ability  to  hold  the  laser  spot  on  the  tank, 
particularly  when  the  tank  is  moving.  Also  terrain  conditions  will  often 
determine  when  the  on-coming  tank  can  first  be  designated.  Consequently,  a 
number  of  different  designator-to-tank  ranges  are  treated  in  this  study. 

Mlssile/proiectile  launch  range.  The  flight  time  of  the  missile  may 
affect  the  length  of  time  the  GLD  must  designate  the  target,  if  the  target  is 
illuminated  for  the  entire  missile  flight.  This  is  not  the  case  for 
projectiles  where  laser  designation  is  required  only  for  the  terminal  part  of 
the  trajectory. 

Tank  response  time.  The  time  from  laser  alarm  until  the  tank  fires 
its  first  round,  then  a  second  round,  then  a  third,  etc,  is  one  of  the  two 
critical  time  factors  in  the  tank  counterfire  duel.  The  longer  the  tank 
response  time,  the  more  survivable  the  GLD.  Time-to-first-fire  and  time- 
between-f ires  have  been  measured  in  various  field  tests,  and  probability 
distributions  are  fit  to  these  data  as  the  model  is  developed. 

Deslgnator*on  time.  The  other  critical  time  factor  is  the  length  of 
time  that  the  target  tank  is  illuminated  by  the  GLD.  The  tradeoff  between 
this  time  and  the  tank  response  time  is  the  essence  of  the  Counterfire  Duel, 
for  long  designator-on  times  decrease  the  survivability  of  the  GLD.  The 
designator-on  time  may  be  reduced  by  special  techniques  such  as  offset 
designation,  in  which  the  GLD  beams  on  a  nearby  object  which  reflects  laser 
energy  but  does  not  trigger  the  tank's  laser  alarm,  then  switches  to  the  tank 
in  the  final  critical  guidance  phase  of  the  weapon  trajectory. 

Single-shot  kill  probability  (SSKP)  for  the  laser-guided  round 
against  the  tank.  This  parameter  is  of  obvious  importance  and  varies  from  one 
laser-guided  weapon  to  another. 

SSKP  for  tank  round  against  laser  designator.  This  parameter  depends 
on  the  type  of  tank  round  fired,  the  tank-to-GLD  range,  the  hardness  of  the  FO 
position  (e.g. ,  in  a  foxhole,  a  bunker,  or  a  forward  observer  vehicle  (FOV)), 
and  the  ground  slope  at  the  GLD.  The  SSKP  is  also  affected  by  the  fire 
control  and  resulting  aiming  errors  of  the  tank  gun,  which  depend  on  whether 
the  tank  comes  to  a  stop  or  fires  on  the  move  and  whether  the  tank  determines 
range  visually  or  by  means  of  a  laser  rangefinder.  Permanent  kill  of  the  GLD 
is  accomplished  by  either  destroying  the  laser  designator  or  disabling  both 
the  GLD  operator  and  observer;  temporary  kill  indicates  either  permanent  kill 
or  disabling  only  the  operator. 

Flight  time  of  tank  round.  This  parameter  is  determined  by  the  type 
of  round  fired  and  the  distance  from  the  tank  to  the  GLD.  Since  this  time  is 
nonzero,  the  possibility  exists  in  the  1-on-1  duel  that  both  the  tank  and  the 
GLD  will  be  killed. 
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Coordination  of  tank  platoon*  In  the  case  of  multiple  missiles  fired 
against  a  tank  platoon,  the  degree  to  which  the  tanks  in  the  platoon  can  alert 
their  companion  tanks  when  they  are  being  designated  may  be  a  major  factor  in 
the  outcome  of  the  counterfire  duel* 

Time  between  missile  fires*  The  time  interval  between  successive 
laser-guided  missile  launches  must  be  coordinated  with  the  GLD  operator  and  is 
dependent  on  the  speed  with  which  he  can  evaluate  missile  hit  or  miss  and 
switch  to  the  next  target-  It  is  expected  that  two  or  three  missiles  may  be 
in  flight  simultaneously  against  a  tank  platoon. 

To  analyze  the  missile  versus  tank  duel,  the  following  exchange  ratio 
is  an  appropriate  performance  measure: 

=  Expected  number  of  tanks  killed _ 

Probability  of  GLD  kill  by  tank  counterfire 

In  the  one  missile  versus  one  tank  duel,  this  exchange  ratio  may  be  simplified 
to 


ER  -  _  -  Probability  of  tank  kill  ___  ___  __ 

Probability  of  GLD  kill  by  tank  counterfire 

This  performance  measure  provides  a  conparative  description  of  the  outcome  of 
the  missile  versus  tank  encounter;  large  values  of  ER  are  favorable  to  the 
missile/GLD  system,  small  values  favorable  to  the  tank.  Note  that  a  direct 
cost-effectiveness  comparison  would  be  difficult  to  formulate  in  this 
analysis,  because  the  GLD  bears  a  much  greater  significance  than  its  actual 
unit  cost  since  it  is  an  essential  part  of  an  expensive  weapon  system. 

IV*  TANK  RESPONSE  TIME  DISTRIBUTIONS*  Various  field  tests  have  been 
conducted  over  the  years  to  measure  the  rapidity  with  which  a  tank  crew  can 
recognize  and  fire  upon  a  target  which  suddenly  threatens  the  tank. 
Representative  of  the  data  chosen  to  portray  these  tank  counterfire  response 
times  are  the  histograms  in  Figure  2,  which  depicts  times  for  the  tank  to  fire 
its  first  round  (left-hand  histogram)  and  times  between  fires  (right-hand 
histogram) • 


First-Round  Histogram 

64  Data  Points 
Mean  m  19,3  9 
Std.  Dev  -  8.1  8 


T1ME  T0  FIRST  FIRE  111 


Between  Rounds  Histogram 


Figure  2.  Typical  Tank  Response  Time  Data 
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In  the  past,  log  normal  distributions  were  selected  to  fit  such  tank 
response  time  data.  However,  since  the  objective  is  to  develop  a  tank 
counterfire  model  which  is  as  realistic  as  possible,  the  tank  must  be 
permitted  any  number  of  counterfires  during  the  designator-on  period.  That 
is,  the  number  of  rounds  the  tank  can  fire  at  the  GLD  will  be  limited  only  by 
the  time  in  which  the  tank  has  to  fire,  and  not  by  any  arbitrary  limit  imposed 
to  simplify  the  derivation  of  the  relevant  mathematical  formulas.  We  shall 
see  that  this  flexibility  is  not  in  consonance  with  the  use  of  lognormal  fits 
to  the  histograms. 

We  introduce  the  notation 

t  -  time  to  nth  tank  fire, 
n 

At  =  time  between  tank  fires* 


Suppose  a  suitable  probability  density  function  f(t,j)  has  been  fit  to  the 
time-to-first-fire  histogram  and  a  density  function  g(At)  to  the  time-between- 
fires  histogram.  Under  the  assumption  that  the  time-from-f irst-f ire-to- 
second-fire  distribution  also  represents  that  between  any  two  successive 
fires,  the  time  to  the  (k+1)st  fire,  for  any  k  ^  1 ,  may  be  written 


&k+1 


=*  t-  +  At  +  .  •  .  +  At 


- v - 

k  independent  choices  of  At 


Since  addition  of  random  variables  corresponds  to  convolution  of  their 
probability  density  functions,  the  density  function  for  t^+1  is  then  given  by 
the  k-fold  convolution 


f(tk+1)  =  f(t,)  *  g(At)  *  ...  *  g(At)  (3) 

v - - v-  — 

k  times 


Each  of  these  convolutions  requires  an  integration,  and  so  the  problem  appears 
to  become  computationally  complicated  beyond  about  4  shots.  The  situation 
would  be  tractable  if  the  distribution  of  At  were  reproductive  (a  distribution 
is  defined  to  be  reproductive  if  the  sum  of  independent  random  variables  each 
with  such  a  distribution  is  a  random  variable  which  again  has  such  a 
distribution).  The  normal  distribution  is  a  well-known  reproductive 
distribution?  however,  this  distribution  does  not  have  the  proper  shape  to  fit 
typical  tank  response-time  histograms. 

Finding  a  reproductive  distribution  to  fit  the  time-between-f ires 
histogram  is  crucial  to  the  development  of  the  present  model;  suitable 
candidate  distributions  are  examined  in  table  1.  The  moment-generating 
functions  are  listed  because  the  usual  proof  of  reproductivity  proceeds  by 
showing  that  the  moment-generating  function  for  the  sum  of  two  independent 
random  variables  with  the  same  type  of  distribution,  which  is  the  product  of 
their  two  moment-generating  functions,  again  corresponds  to  a  distribution  of 
that  same  type.  In  fact  it  turns  out  that  the  gamma,  lognormal,  and  Rayleigh 
distributions  all  fit  time-between-f ires  data  reasonably  well,  but  the 
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TABLE  f  .  CANDIDATE  Tf ME-BETWEEN-F I  RES  DISTRIBUTIONS 
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lognormal  and  Rayleigh  distributions  are  not  reproductive*  The  gamma 
distribution  is  not  in  general  reproductive,  but  this  distribution  is 
reproductive  in  the  special  case  of  repeated  addition  of  a  random  variable  to 
itself.  And  this  is  exactly  the  case  here,  where  various  values  of  At  are 
summed* 

Gamma  distributions,  fit  to  the  tank  response  time  data,  then  enable  us  to 
determine  the  density  functions  for  tk+1,  for  k  >_  1,  with  a  single 
convolution. 

With  the  notation 

f00  = 

9(it)  *  fX,n(At) 

to  denote  the  density  functions  of  such  gamma  distributions,  we  can  express 
the  density  functions  for  tk+1  as 

f(tk+1J  -  f(t,)  *  g(At)  *  ...  *  g(At) 

W - v - J 

k  times 


^1^1  (fc0  *  fX,kh(kAt) 


because  of  the  partial  reproductivity  of  the  gamma  distribution.  Therefore, 
we  have  the  formula 


f  tk+i 


tljdti 


(4) 


for  each  k  >_  1 . 

The  parameters  X  and  n  of  the  gamma  distribution  fits  to  the  response  time 
histograms  are  obtained  by  equating  nA  to  the  test  data  mean  and  nA  to  the 
test  data  variance*  In  this  way  the  following  gamma  distribution  fits  are 
derived: 


( 1 )  t ime-t o-f irst-f ire 
X1  -■  0.269 
T11  «  5.262 

f(t,)  =  (2.783  x  10-5,  t4.262exp(_0>269t^ 

This  fit  passes  the  chi-square  test,  since  a  value  of 
X2  =  4.33 
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is  calculated  with  3  degrees  of  freedom,  which  conpares  favorably  with  the 
critical  value  at  the  5-percent  level  (type  I  error) 

V2 

X0  =*  7.815 

(2)  time-between-f ires 
A  =  0,3273 
T1  =  5.204 


g(At)  =  (9.123  x  1Q~5)At4*2°4exp(-0  .3273At) 


This  fit  also  passes  the  chi-square  test,  at  least  at  the  2-percent  level. 


These  gamma  distribution  fits  are  shown  in  figures  3  and  4. 


DATA  POINTS  -  64 


TIME  TO  FIRST  FIRE  (s) 

Figure  3.  Gamma  distribution  fit  to  first-round  histogram. 


DATA  POINTS  -  72 


Figure  4.  Gamma  distribution  fit  to  between-rounds  histogram. 


Substitution  of  the  gamma  distribution  parameters  into  equation  (4)  then 
gives  the  formulas  for  the  time-to-nth-fire  probability  densities: 


f(t  )  =  (2.783  x  10_5)t^'262exp(-0.269t1> 


f(tk+1)  -  (2.783  x  10-5)  (0.00299)kexp(-0.3273t,c+1)  • 

T( 5.204k)  e!*<0-0583V  dt1 


,\+1  4.282^-  tl,5.204k-1 

0  1 


for  k  >_  1 

Sraphs  of  some  of  these  time-to-nth-f ire  probability  density  functions  are 
presented  in  figures  5  through  9. 


Figure  5  .  Probability  density 

for  time- to- first- fire 


Figure  6  .  Probability  density 

for  time-to-second-fire 


TtME-to-3rd*FlRE,  t,  (9) 


Figure  7,  Probability 

density  for  time-to-third- 


Figure  8  .  Probability 

density  for  time-to- 
fourth-f ire 


Figure  9  ,  Probability 

density  for  time-to-fifth- 
fire 


V.  TANK  COUNTERFIRE  MODELS-  With  the  tank  response  time  distributions 
derived  in  the  previous  section,  analytic  models  can  now  be  developed  for  the 
1-on-1  duel  and  the  3-on-3  encounter*  Formulas  will  be  given  expressing  the 
probability  of  GLD  destruction  and  the  expected  number  of  tanks  killed  in  each 
encounter.  Any  number  of  tank  counterfires  may  occur  during  the  designator-on 
time  and  the  flight  time  of  the  tank  round  is  explicitly  considered. 
Coordination  of  counterfire  from  the  tank  platoon  is  also  treated. 

Several  ground  rules  are  established  in  developing  the  model,  although 
different  assumptions  could  be  readily  incorporated.  Line-of-sight  between 
the  GLD  and  the  tank  is  maintained  throughout  the  duel  encounter?  this  is  not 
unrealistic,  especially  when  the  tank  or  tanks  stop  to  fire.  If  the  tank 
laser  alarm  system  is  operating  properly,  it  is  assumed  to  detect  the 
existence  of  the  laser  spot  as  soon  as  the  tank  is  illuminated  (if  there  were 
a  known  lag  time,  it  would  be  subtracted  from  the  designator-on  time).  Once 
the  laser  alarm  sounds,  the  tank  crew  is  assumed  to  devote  full  attention  to 
defeating  the  GLD  rather  than  pursuing  its  original  mission.  If  the  tank 
destroys  the  GLD,  the  laser-guided  round  is  rendered  harmless  to  the  tank. 
Suppression  is  not  played,  so  tank  rounds  which  miss  the  GLD  do  not  disturb 
the  GLD  operator* s  illumination  of  the  tank  target  and  hence  do  not  disrupt 
the  operation  of  the  laser-guided  weapon  system.  Tanks  may  fire  either  at  a 
stop  or  on  the  move,  and  they  may  adjust  their  aim  between  rounds;  these 
possibilities  can  be  handled  by  using  the  appropriate  SSKP  data.  The 
assumption  that  the  tanks  fire  at  the  GLD  only  during  the  designator-on  time 
serves  as  the  end-of-game  criterion. 
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One-on-One  Duel.  The  duel  between  a  single  laser-guided  missile 
directed  against  an  individual  tank  by  a  GLD,  with  that  tank  counterfiring 
against  the  GLD,  will  be  treated  first.  The  following  notation  will  be  needed 
to  express  the  resulting  probability  formulas. 

T1  -  designator-on  time  =  time  from  laser  alarm 
to  missile  hit. 

T2  =  flight  time  of  tank-fired  round 
I>D  =  probability  that  the  tank  detects  the  laser 
designation  and  fires  at  the  GLD 
PTK  “  SSKP  by  laser-guided  missile 
PGK  -  SSKP  by  tank  round 

Pn  -  probability  that  the  tank  fires  the  n^  round  in 
seconds 

=  Prob  (tn  <.  T1 } 

p*  =  probability  that  the  tank  fires  the  nfc  round  in 

11  T1  "  t2  secon<is 

=  Prob  (tn  <_  T1  -  T2) 

Since  the  tank  round  is  unguided  after  fire,  both  the  tank  and  the  GLD  could 

be  destroyed  if  the  tank  fires  a  round  at  the  GLD  less  than  T2  seconds  before 

missile  hit.  It  is  for  this  reason  that  the  distinction  between  P„  and  P*  is 

n  n 

drawn. 

The  formulas  which  govern  the  1-on-1  duel  may  be  written  as  follows: 


P  «  Prob  (tank  killed  by  missile) 

=  %['  -  V*  *  Pd  E  -  'S+lH1  -  *«)“] 

n=1 

Q  =  Prob  (GLD  killed  by  tank) 

GO 

*  V®  L  M1  * 

n*l 

P 


(7) 


(8) 


(9) 


Formula  ( 7)  for  tank  kill  is  derived  by  expressing  the  event  that  the  tank  is 
killed  by  the  laser-guided  missile  as  a  sum  of  disjoint  events,  where  the  nth 
term  in  the  summation  corresponds  to  the  event  in  which  the  tank  has  fired 
exactly  n  rounds  before  being  destroyed  by  the  missile  but  all  miss. 

Likewise,  formula  (S)  for  GLD  kill  is  constructed  from  a  sum  of  disjoint 
events,  with  the  ntl1  term  in  the  summation  representing  the  event  in  which  the 
tank  fires  n  rounds  before  missile  hit  and  the  n^  round  is  the  one  that 
destroys  the  GLD.  The  exchange  ratio  ER  is  then  given  by  the  quotient  of 
these  two  probabilities. 
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Three  Missiles  versus  Tank  Platoon*  Derivation  of  the  probability 
formulas  is  considerably  more  complex  for  the  counterfire  scenario  of  three 
laser-guided  missiles  rapid-fired  under  the  control  of  one  GLD  against  a 
platoon  of  three  tanks*  In  this  case  the  GLD  switches  to  another  tank  upon 
tank  kill  and  two  or  three  missiles  may  be  in  the  air  at  one  time  in  an  attack 
sequence*  An  important  factor  in  the  analysis  of  this  scenario  is  the  degree 
to  which  the  tanks  can  alert  their  con^anion  tanks  to  the  presence  of  the 
laser  designator*  Two  cases  will  be  considered;  an  uncoordinated  platoon, 
and  a  perfectly  coordinated  platoon.  The  following  notation,  which  extends 
that  used  for  the  one-on-one  duel,  will  be  needed  in  developing  the  formulas 
(T  denotes  a  time  variable) ; 

Pn(T)  “  probability  of  nth  tank  fire  in  T  seconds 

♦  th 

P^(T)  =  probability  of  ntn  tank  fire  in  T  -  T2  seconds 


p* (t)  -  probability  that  all  tank  rounds  fired  in  T  -  T2 
seconds  miss 

=  1  -  p/(T)  +  PD  £  (P* (T)  -  P*+1CT>)(1  -  PGK)n 

1  n=l 

P (T)  =  probability  that  all  tank  rounds  fired  in  T  seconds 
miss  00  n 

=  1  -  p^  (T)  h-  pd  £  (Pn(T)  -  pb+1(t>)  (1  -  PGK) 

n=l 

Q(T)  =  probability  that  the  tank  fires  a  GLD  -killing  round 
in  T  seconds 

=  £)  P  (T)  (1  -  P_„')n‘1 

D  GK  ,  n  V  GK/ 
n=l 

Q*(T)  =  probability  that  the  tank  fires  a  GLD  -killing  round 
in  T  -  T2  seconds 

=  PDPGK  ^  Pn(T)  (X  "  PGk) 
n=i 

T3  =  length  of  time  that  GLD  illuminates  companion  tanks 
=  time  between  missile  arrivals 
T5  =  companion  tank  reaction  time  delay* 
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p(i)  =  probability  that  i  tanks  are  killed 

Q  -  probability  that  GLD  is  killed 

Q{i*  =  probability  that  GLD  is  killed  and  i  tanks  are 
killed 

It  should  be  clear  that 

p(i)  -  -  probability  that  GLD  survives  and  i  tanks  are 

killed, 

and  that  if  there  are  m  target  tanks  (e.g.,  m  =  3  for  a  tank  platoon), 
then 

m 

Vi  p(i)  =1  , 

i-0 

m 

£  2(i)  -  2  - 

i=0 

m  ( i ) 

E  =  expected  number  of  tanks  killed  =  ^  iP  • 

i-0 


T1  will  now  be  interpreted  as  the  time  from  laser  alarm  of  the  first  tank 
designated  to  the  arrival  of  the  first  missile,  to  be  consistent  with  the 
usage  of  the  one-on-one  duel.  It  is  easily  shown  that  for  any  value  of  T 


P(T)  +  Q(T)  =  1 
P*(T)  +  Q*(T)  =1  . 


The  convention  is  adopted  that  P*(T)  -  1  and  Q*(T)  =  0  when  T  <  T2. 

Uncoordinated  Tank  Platoon.  First,  suppose  the  designated  tanks  do 
not,  or  are  unable  to,  alert  companion  tanks  in  the  platoon  to  the  presence  of 
the  GLD.  In  this  case,  the  tanks  counterfire  only  upon  being  designated 
themselves,  so  the  logical  GLD  strategy  is  to  beam  on  a  tank  until  it  is 
killed,  then  switch  to  the  next  tank  and  designate  it  for  T^  seconds  before 
the  impact  of  the  next  missile.  The  time,  T^,  between  missile  fires  has  to 
allow  for  this  switching  time.  If  a  missile  misses  its  intended  target,  the 
GLD  still  maintains  its  designation  of  the  target  tank  instead  of  switching  to 
another  tank  at  this  point,  because  switching  would  incur  counterfire  from 
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another  tank  in  addition  to  the  continued  counterfire  from  the  tank  originally 
designated.  The  game  ends  when  the  last  missile  reaches  its  target?  the  GLD 
switches  off  and  no  further  tank  rounds  are  fired  (however,  the  effect  of  a 
tank  round  already  in  flight  is  included). 

The  equations  governing  this  situation  are  presented  in  figure  10.  The 
assumption  is  made  that  the  time  between  missile  arrivals  is  greater  than  the 
flight  time  of  the  tank  round;  that  is 


This  assumption  is  valid  in  most  tank  counterfire  cases.  Modifications  have 
to  be  made  to  the  formulas  if  this  inequality  is  reversed?  such  formulas  are 
included  in  another  report3. 


=  Prob  (3  tanks  killed) 

-  P(Ti)P(Tj)P*(Tj)P^k 

P^  =  Prob  (exactly  2  tanks  killed) 

-  P(T,)([q(T3)  -  Q*(T3)]  ♦  P(Tj)[l  -/*<T3>Ptk])p?k 
+  p(t,)p*(t3  +  T^Hl  -  Ptk)Ptk 

+  P(Tj  +  Tj^PifTjMl  -  pTK)pTK 
=  Prob  (exactly  one  tank  killed) 

*  [<1<T1>  -  n 
+  P(T1)(q*(T3)  +  [q*(T3  +  T4)  -  Q*(T3)]0  -  PTK)  +  p*(T3  +  V°  '  V  /  TK 

+  [KT,  +Tu)  -  d*(T,  ♦  V](l  -  PTK)PTK 

*  P(T,  +  T4)j^l  -  P*(T3)Ptk](1  -  PTK)PTK 

+  P*(T,  +  2T4)(1  -  pTK)2pTK 


Figure  10.  Formulas:  Three  missiles  versus  uncoordinated  tank  platoon. 


a  -■  —  .  .  ■ 

Joseph  V.  Michalowicz,  Analysis  of  the  Laser^uided  Maverick  versus  Tank 
Counterfire  Duel,  Harry  Diamond  Laboratories,  HDL-TR-1909  (June  1980). 
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I  Prob  (no  tanks  killed) 


=  Q*(T1) 

+  [q*(T,  +  Tj,)  -  a*(T,)](i  -  ptk) 

+  [q*(T,  +  2T4)  -  Q*(T,  +  V](>  -  PTK)2 

+  p*(T,  +  2T4)  (1  -  PTK)3 

Q  -  Prob  (GLD  killed) 

=  P(T,)P(T3)a(T3)PTK 

+  P(T1)(q(T3)  +  [®(T3  +  Tl,)  "  "  Ptk^)PTK 

+  p(t,  +  T4)a(T3)o  -  pTK)pTK  2 

+  Q(Tj)  ♦  [d(T,  +TA)  -  Q(T,)]0  *  PTK)  +  [<KT)  +  2V  ’  Q(T1  +  V](I  ‘  V 

q(3)  =  P(TijRnt»(T3)PW^0(T3)  -  Q*(T3jjPTK 

q(2)  «  P[TijPTlCP(T3jPTK  +  (1  -  PXKi[Q(T3)  -  Q*(T3jj)  +  P(Ti]PTKeTK[Q(T3]  *  Q*(T3J] 

+  p(Ti)Ptk(1  '  PtkJ*’tk[Q(T3  +  T4j  -  Q*(T3  +  T4 J J 
+  (1  -  PTic)p(T1  +  TfjPTKpTKlQfTa)  -  Q*(T3jJ 
G(1)  "  PTK  *  Q*(Tl) J  +  P(TijQ*(TaJ  +  P(TiJ(1  -  Ptk)1Q*(t3  +  T4J  -  Q*  ( T  3  J  J 

+  P(Tl)(1  -  PTk)2IQ(t3  +  -  »*(T3  *  T*)j) 

+  (1  "  PTk)PTK  (L0(t1  +  T4j  -  0*(T!  +  T4jj  +  P(T!  +  T4jQ*(T3) 

+  P(Tl  ♦  T4J(1  -  PTK)[Q(T3j  -  Q*  CT3)  j) 

+  [1  -  Ptk)2PtkL«(TI  +  2T4j  -  Q*(T1  +  2T\)j 

g(0)  =  Q*^Tij  +  (1  -  PtkJ[Q*[T1  +  T^J  ‘  Q*[TlJ  J  +  (1  “  PTK) 2  [Q*  (T 1  +  »«.)  -  Q*(T!  +  t4Jj 
+  (1  -  PTJtJ3[Q(Tl  +  »•»]  -  Q*fTl  -  2T4  J  J 

E  =  Expected  number  of  tahks  killed 

.  3P(3)  +  2P(«  +  pf) 

E 

Exchange  ratio  =  ^ 

Figure  10.  Formulas:  Three  missiles  versus  uncoordinated  tank  platoon  (cont’d) 
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Each  of  the  formulas  in  Figure  10  is  derived  from  a  a  sum  of  disjoint 
events  as  for  the  1 -on-1  duel,  but  it  is  clear  that  the  complexity  of  the 
formulas  has  greatly  increased  in  the  multiple  missile/multiple  tanks  case. 


We  choose  one  example,  say  the  formula  for  P^1^,  to  show  how  these  equations 
are  derived.  Now  P'  '  is  the  probability  that  exactly  one  tank  is  killed;  the 
disjoint  events  and  their  corresponding  probabilities,  which  add  up  to  give 
the  formula  for  P^1',  are  shown  in  Figure  11. 


Event  Probability 

1st  missile  kills  1st  tank  but  an  in-flight  PTK^T1 )  *T1 )  ] 

round  from  the  1st  tank  kills  the  GLD. 

1st  missile  kills  1st  tank  and  either  the  GLD  P(T1  jP^tQ*  (T^ )  + 

is  killed  by  2nd  tank  before  2nd  missile 

arrives  or  before  the  3rd  missile  if  the  (1“PTK^  ^  (T3+T4)-Q  (Tj)] 

2nd  missile  misses,  or  both  the  2nd 

^  A 

and  3rd  missiles  miss.  +  P  (T3+T4)  (1-P^)^) 


1st  missile  misses  but  2nd  missile  kills 
1st  tank  and  an  in-flight  round  from 
the  1st  tank  kills  the  GLD. 

1st  missile  misses  but  2nd  missile  kills 
1st  tank  and  either  2nd  tank  counterfire 
kills  the  GLD  or  the  3rd  missile  misses. 


(1— ptk)ptk[q(t1+t4)  -  q*(t1+t4)] 
p(t1+t4hi-ptk)ptk  [q*(t3) 

+  P*(T3)  (1-P^)] 


1st  and  2nd  missiles  miss  but  the  3rd 
missile  kills  the  1st  tank 


P*(T1+2T4)(1-Ptk)2Ptk 


Figure  11. 


Disjoint  events  used  to  derive  formula  for  P 


(1) 
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Coordinated  Tank  Platoon,  On  the  other  hand,  suppose  that  the  tank 
platoon  is  coordinated  in  the  sense  that  upon  laser  alarm  the  designated  tank 
alerts  its  conf>anion  tanks  and  they  begin  counterfire  as  well,  after  a  time 
delay,  T^,  required  either  to  receive  the  information  from  the  designated  tank 
or  to  observe  its  reaction  to  recognition  of  the  laser  illumination  (e.g. , 
stopping,  slewing  of  the  turret).  Since  in  practice  T-j  >  T5,  all  tanks  will 
have  begun  counterfiring  at  the  GIiD  before  the  arrival  of  the  first  missile. 
The  GLD  is  assumed  to  use  the  same  designation  strategy  as  in  the  case  of  the 
uncoordinated  tank  platoon,  since  it  is  of  no  benefit  to  switch  targets  until 
the  kill  is  observed.  The  accuracy,  and  hence  the  SSKP,  of  the  companion  tank 
rounds  will  be  assumed  to  be  the  same  as  the  designated  tank. 

In  figure  12  the  equations  are  presented  for  this  case;  derivations  are 
again  based  on  sums  of  disjoint  events.  The  assumption  that  T^  T2  remains 
in  effect  for  these  formulas: 


p<3> 


=*  Prob  (3  tanks  killed) 

=  P(T1)P(T]  +  T4  -  TS)P*(T1 


+  2T. 


-vp 


3 

TK 


-  Prob  (exactly  2  tanks  killed) 

=  P(T,)[Q(T1  +  *  T5)  -  Q*(T]  +V-  V]P*(T1  +  T*  "  T5)PTK 

+  P(T1)P(T1  +  T4  -  Tj)  [Qa ( T i  +  2T4  "  T5)  '  Q*(T!  +  TA  "  T5^1  PTK 
+  P(T1)P(T]  +  -  T5)PMT1  +  2T4  -  T5)P2TKd  ~  PTK) 

+  P(T,)  [P*(T,  +  2T4  -  T5)J2P2k(1  -  PTK) 

+  P (Tj  +  T4)[p*(T,  +  2T4  "  T5^]2^  "  PTK^PTK 


=  Prob  (exactly  one  tank  killed) 

-  [Q < T | )  -  (J*(T,)][P*(T,  -  T5)]2Ptk 

+  P<T1 ‘  T5)]2  "  [P*(T1  +  T|<  '  V]  )PTK 

+  P(T,)  ([pA<T,  +  T4  '  V]2  '  [P*(T1  +  2T4  "  T5)]2)PTK°  ‘  PTK) 

2  2 

+  P(T,)[P*(T,  +  2T4  -  T5>]  Ptk0  -  PTK) 

+  j^Qd,  +  T„)  -  «*(T,  +  V][P*(TI  +  T4  "  T5)]2<1  ‘  PTK>PTK 
+  P(T,  +  V([P*d,  +  \  -  Tj)]2  -  [P*(T,  +  2T4  -  T5  )]‘)(1  '  PTK)PTK 
*  p(T,  +  Tt)[P*(T,  +  2T4  -  T5)]20  V2pTK 
+  P*(T1  +  2TI))^P*(T)  +  -  3"^)J2(I  -  pTK>2pTK 


Figure  12.  Formulas:  three  missiles  versus  coordinated  tank  platoon. 
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P^  «  Prob  (no  tanks  killed) 

-  1  -  PMT^PMT,  -  Tj)]2 

*  (»*<!,)  [P*(T,  -  T5)]2  -  P*(T,  +  T4)[P*(T,  ♦  T4  -  T5)]2)(l  -  PT|<) 

+  (P*(T1  *  V[P*(T1  +  t4  -  T5>]2  -  p*<Ti  +  2V[P*'(T1  -  2t4  -  V]2)0  ’  V2 

+  ?*(Ty  *  2V[P*(T1  +  2Tk  '  V]Z(I  '  PTK)3 

Q  ■  Prob  (GLD  killed) 

■  1  *  Prob  (GLD  not  killed) 

-  i  -  (popper,  +  t4  -  t5)p(t,  +  2t4  -  t5)p2k 
+  P(T,)[P<T,  ♦  2T4  -  T5)J2ptkO  -  PTK> 

+  P(T,  +  Ti,)[plTi  2Tk  *  T5)]  C  *  ’W’tk 
♦  P(T,  +  avfpcr,  *  ZT4  -  ts)]2(i  -  ptk)2) 

q(3)  .  L JP<pjqP(Ti  t  TU  -  t5)PTK[Q(fl  ♦  »>*  -  *5j  *  Q*(Ti  +  2T<*  -  TsjjP^ 
q«2>  ,  PfTlJP^Ptu  (|Q(Ti  +  T4  -  T5j  -  g*(Tt  +  T4  -  T5)JP*(Ti  +  Ti*  “  T*,] 

P(Tj  *  T 4  -  Ts)[fi*(Tl  *  2T4  -  t5J  -  0*(T!  +  Ti,  -  Tsjj 

+  P(T1  ♦  Th  -  r5)(!  -  Pth)[0(ti  *  'XT*  -  r5j  -  +  zrH  -  t5j|^ 

*  nTlJPTK(1  *  pTxJPTK:  ([**(*1  +  »4  '  *5jj*  -  ;*(T!  ♦  »„  -  TSjj2) 

*  t1  *  *Tk)*(*1  +  T‘0PTKPTK  ([p*(*l  +'  rrH  T5)|2  -  [9{*l  ♦  2Th  -  Tsjj2) 

QM>  -  Ptr  j\c(nj  -  Q*fTi)  j[p*(tl  -  T5JJ2  ♦  P(T|)  ^[P*(TL  -  Tsjj2  -  [P-(Tt  +  T4  ■*  Tsjj2) 

+  P(Ti)[t  -  P^j  ^[P*(T|  ♦  T4  -  TSjJ2  -  [P*(Tl  -  2T4  -  TSjj2j 

♦  P(Ti)(1  -  PtkJ^  (lp*(Tl  ♦  2T*  -  Tsjj2  -  [V{Xx  *  2T4  -  T5)J2j] 

+  (T  -  pTk)pTK  [[C(t1  +  *<•)  -  0*(Ti  *  T“JJlP*(Tl  +  T-  -  t5)J 2 

i-  P(Ti  *  T4]  ^[P*(T!  ♦  T|*  -  Tsjj2  -  [P*(Tt  +  2T4  -  TSj 

+  p(*l  *  T4J(1  -  Ptk)  ^P*(T1  *  2T4  *  T*)J2  -  [P(TV  *  2T4  -  TS)j^J 

♦  (1  -  pTK)2pTK  +  2T4j^P*(Ti  +  '  TS)j2  -  p(Tj  «■  2T4][»(Tl  ♦  2T4  -  T3jj2j 

q<0>  .  1  -  P*(Ti  J  [P*(Tj,  -  T5jj2  *■  (1  -  PtkJ  ^P*(Tl)^P*(Tl  -  Tf5jj2~  p*(Tl  +  t4)[P*[Ti  ♦  T*  -  T5jj2) 

+  (1  -  Ptr)2  (P*(Tl  +  T4J[P*(iTi  ♦  T4  -  TSjj2  -  P-(Ti  f  2T4j[P*(Ti  ♦  2tf4  -  T5)J2) 

+  (1  -  Piyjtj3  (P*(»l  ♦  2T4)[P*[Tl  ♦  »4  -  TS)J2  -  P(Tl  *  »4)l»(*X  +  »•*  ~  TS)  J2) 

E  -  Expected  number  of  tanks  killed 

*  3 +  2?^  + 

Exchange  ratio  =»  ^• 

Figure  12*  Formulas:  three  missiles  versus  coordinated  tank  platoon. (cont1 d) 
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The  model  has  been  developed  thus  far  under  the  assumption  that  the  tank 
round  SSKP,  PGK,  remains  constant  for  each  fire-  However,  the  tanker  may  be 
able  to  adjust  his  aim  based  on  the  result  of  his  first  fire  and  thus  improve 
the  SSKP  for  succeeding  fires,  or  he  may  load  a  different  round  for  subsequent 
fires  from  the  one  he  was  originally  carrying  in  his  main  gun,  or  he  may 
estimate  the  range  to  the  GLD  either  visually  or  with  a  laser  rangefinder  and 
thus  increase  the  accuracy  and  lethality  of  the  rounds  fired  at  the  e^qpense  of 
a  small  time  delay  for  ranging-  All  of  these  sophistications  can  be  added  to 
the  above  formulas  with  little  difficulty. 

VI-  SAMPLE  RESULTS  AND  CONCLUSIONS-  The  tank  counterfire  duel  models 
which  have  been  presented  were  exercised  for  many  combinations  of  parameters 
in  order  to  provide  the  encounter  outcomes  for  the  SSTF  force-on-force 
analysis-  One  way  to  graphically  display  the  results  of  a  counterfire  duel 
is  demonstrated  in  Figure  13  which  is  based  upon  a  sample  set  of  input 
parameters.  The  graph  shows  the  expected  number  of  tanks  killed  (solid 
curves)  and  the  probability  of  GLD  kill  (dashed  curves)  in  a  3-on-3  encounter 
over  a  range  of  tank— to^GLD  distances,  with  the  GLD  located  in  either  a 
foxhole,  a  Forward  observer  Vehicle  or  a  bunker- 


TANK-TQ-GLD  RANGE  (km) 

Figure  13.  Sample  Counterfire  Duel  Results 

These  same  results  are  represented  in  tabular  form,  for  a  particular 
intermediate  tank-tOH3LD  range,  in  Table  2- 
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Table  2*  Sample  Tabular  Counterfire  Duel  Results 


Tank- 

to-GLD 

range 

(km) 

GLD 

Position 

p<°> 

p(1> 

P(2) 

p<3) 

E 

Q(°) 

Q(1) 

g(2) 

q(3) 

Q 

ER 

1.5 

Foxhole 

0.170 

0.409 

0  ..33  1 

0.090 

1.341 

0.002 

0.002 

0.000 

0.000 

0.004 

335 

1.5 

Bunker 

0.203 

0.405 

0.308 

0.0  84 

1.2  73 

0.058 

0.043 

0.007 

0.000 

0.108 

12 

1.5 

FOV 

0.176 

0.409 

0.327 

0.088 

1  .327 

0.014 

0.010 

0.002 

0.000 

0.026 

51 

The  exchange  ratios  obtained  in  this  sample  calculation  indicate  that  the 
foxhole  position  is  the  most  survivable  and  effective  GLD  position,  while  the 
bunker  is  the  least  survivable  to  tank  counterfire.  No  actual  conclusions 
should  be  drawn  from  this  san^le  calculation  since  these  results  are  highly 
dependent  on  the  choice  of  input  parameters,  such  as  the  type  of  tank  rounds 
fired,  but  this  same  line  of  reasoning  would  be  used  with  actual  data  to  draw 
survivability  conclusions. 

Incorporating  the  counterfire  duel  methodology  into  the  SSTF  war  game 
scenario  provided  the  data  needed  to  determine  whether  or  not  the  GLD  is 
survivable  on  the  battlefield.  Due  to  classification,  the  conclusion  of  the 
SSTF  analysis  cannot  be  presented  in  this  paper.  However,  it  should  be 
mentioned  that  the  counterfire  duel  methodology  was  also  very  useful  in 
testing  the  sensitivity  of  the  SSTF  conclusions  to  variations  in  the  many 
input  parameters  required  in  the  study  —  such  as  type  of  tank  rounds  fired, 
motion  of  tank,  handoff  time  between  tanks,  designation  mode,  slope  at  GLD 
position,  and  laser-guided  weapon  rate  of  fire  —  to  determine  which  were 
critical*  Such  sensitivity  analyses  are  often  instrumental  in  the  development 
of  weapon  system  improvements. 

It  is  anticipated  that  the  analytic  model  presented  in  this  paper  is 
sufficiently  general  to  have  applications  to  many  types  of  duel  encounters 
where  response  time  is  the  critical  factor.  We  hope  that  the  reader  will  find 
the  model  useful  in  such  situations;  further  details  can  be  obtained  from  the 
author. 
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Abstract 

Battlefield  units  adjacent  to  a  targeted  unit  must  maintain  some 
separation  distance  to  avoid  collateral  damage*  A  tabular  algorithm  for 
determining  such  separation  distances  based  on  unit  damage  criteria  and  weapon 
delivery  errors  for  several  confidence  levels  is  presented.  The  algorithm 
depends  on  a  numerical  technique  for  integrating  a  two-dimensional  weapon 
burst  distribution  function  over  some  base  region  and  an  iterative  technique 
to  obtain  separation  distances  given  other  known  parametes,  Hie  use  of  these 
numerical  techniques  is  discussed  along  with  several  current  applications  of 
the  algorithm, 

1,  INTRODUCTION 

On  the  tactical  nuclear  battlefield,  if  the  enemy  (Red)  is  able  to 
accurately  determine  the  location  of  a  high-priority,  friendly  (Blue)  unit, 
he  is  expected  to  fire  a  nuclear  round  of  sufficient  yield  and  accuracy 
to  destroy  the  target  with  a  high  degree  of  confidence.  That  unit  is 
effectively  lost,  so  the  in5>ortant  question  concerning  Blue  survivability 
is  the  "bonus  damage"  produced  by  that  nuclear  fire  on  adjacent,  nontargeted 
units, 

A  typical  problem  is  the  calculation  of  the  desired  minimum  separation 
distance  between  "neighbor"  units.  For  a  given  confidence  level,  C,  this 
distance  is  determined  as  that  at  which  some  specified  environment  or 
environments,  due  to  a  nuclear  burst  at  the  target  unit,  is  exceeded  at  the 
adjacent  neighbor  unit  with  a  probability  of  only  1  -  C,  This  report  presents 
a  handy  tabular  algorithm  to  calculate  these  specific  environmental  criteria 
for  various  confidence  levels. 

Applications  of  the  methodology  are  developed  for  several  examples:  (1) 
command  post  survivability,  (2)  weapon  employment,  and  (3)  trade-off  between 
hardening  and  operational  deployment* 


2*  METHODOLOGY 


Let  us  suppose  that  the  target  unit  is  located  at  ground  zero  (GZ)  and 
that  the  burst  point  (more  precisely,  the  projection  on  the  ground  of  the 
point  at  which  detonation  occurs)  has  a  two-dimensional  normal  distribution 
about  GZ  with  density  function 


pU,y) 


e*(x2+y2)/2a2 


(1) 


627 


where  the  standard  deviation  <3  (of  the  marginal  distributions)  results  from 
both  weapon-delivery  error  and  targets location  error*  Assume  that  the 
adjacent  unit  is  located  at  a  separation  distance  S  from  GZ  as  shown  in  figure 
1.  Ftor  a  given  confidence  level,  C  (e.g. ,  C  ®  0.90),  the  distance  D  is 
calculated  at  which  the  probability  is  only  1  -  C  that  the  burst  point  of  the 
round  falls  within  D  of  the  adjacent  unit,  from  the  following  formula: 


(2) 


Figure  X.  Geometry  of  Units. 


Solving  equation  (2)  for  D  as  a  function  of  C  clearly  involves  a  process 
of  both  integration  and  iteration.  A  "stack  of  disks"  method  is  used  to 
perform  the  double  integration  of  the  integrand,  whose  graph  yields  the 
surface  shown  in  figure  2. 

Z 


-TRUNCATED  DISK 


lira2 


($,0,0) 


Figure  2.  Integration  Surface. 


The  double  integral  is  represented  by  the  volume  of  the  figure  formed  by  the 
intersection  of  the  surface  with  a  cylinder  of  radius  D,  centered  at  the  point 
(S,  0,  0).  This  volume  is  calculated  by  slicing  the  figure  into  di3ks,  each 
of  which  has  a  cross  section  which  is  either  a  circle  or  the  intersection  of 
two  circles,  and  adding  up  the  volume  of  all  of  the  disks.  The  iterative 
process  required  to  solve  for  D  for  a  given  C  is  a  "telescoping"  technique 
akin  to  that  used  for  finding  roots  of  polynomial  equations* 

Fomulas  have  been  derived1  for  calculating  the  following  nuclear 
environments  as  a  function  of  range: 

Initial  Nuclear  Radiation 


Total  dose 

Transient  Radiation  Effects  on  Electronics 
*  *  Neutron  Fluence 

* •  Peak  gamma-dose  rate 


Thermal  Radiation 


•  Radiant  Energy 

'  Maximum  Thermal  Ir radiance 

*W.  E.  Sweeney,  Jr.,  Cyrus  G.  Moazed,  and  John  S.  Wicklund  "Nuclear  Weapons 
Environments  for  Vulnerability  Assessments  to  Support  Tactical  Nuclear  Warfare 
studies  (U),"  Harry  Diamond  Laboratories  TM-77-4  (June  1977).  (CONFIDENTIAL) 
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Blast 


*  Peak  static  overpressure  (Ap) 

*  Peak  dynamic  pressure 

*  Overpressure  impulse 

•  Dynamic  pressure  impulse  (1^) 

•  Vehicle  overturn  (Api  -  K) 

<1 

Low  Altitude  Electromagnetic  Pulse 

•  vertical  electric  field 

*  Radial  electric  field 

•  Azimuthal  magnetic  field 


An  exan£>le  of  these  formulas  will  be  given  in  the  Applications  section.  Once 
the  distance  D  has  been  calculated  as  described  above,  the  desired 
environmental  criterion,  Ec,  can  be  determined  by  selecting  the  appropriate 
formula  to  confute  the  environment  at  range  D.  This  environment,  Ec,  is  then 
exceeded  at  the  adjacent  target  at  most  the  fraction  1-C  of  the  time,  and 
equipment- hardening  decisions  could  then  be  made  based  on  this  criterion. 


3.  RESULTS 


Tables  1  through  11  present  the  values  of  D  corresponding  to  the  following 
choices  of  input  parameters: 


Separation  distance,  S: 
Standard  deviation,  a: 
Confidence  level,  C: 


2 ,  3,  4 ,  *  <  < ,  12  km 

100,  150,  200,  950,  1000  m 

0.99,  0.95,  0.90,  0.75,  0.50 


4.  APPLICATIONS 

4.1  Command  Post  Survivability 

A  subject  of  considerable  current  interest  is  the  specification  of 
the  optimal  command  post  architecture  for  the  integrated  battlefield.  Studies 
have  been  made  of  the  enhancement  of  the  survivability  of  command  posts 
through  dispersion  and  redundancy  measures.  In  the  dispersed  command  post 
structure,  separate  independent  functions  — —  such  as  the  air  support 
operations  center,  the  all-source  intelligence  center,  the  fire  support 
elements,  etc.  —  are  dispersed  into  separate  areas,  so  that  if  one  area 
happens  to  be  destroyed,  the  other  areas  can  continue  to  function.  Although 
there  are  certainly  other  considerations,  such  as  the  extra  communications 
needed  which  may  limit  the  distance  permitted  between  various  elements,  one 


2  John  R.  Bondanella,  "Corps  Command  Post  Architecture  for  the  19  86-1990 
Integrated  Battlefield  -  A  Vulnerability  Analysis,"  thesis,  U.  S.  Army  Command 
and  General  Staff  College,  Ft.  Leavenworth,  Kansas  (June  1980). 
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TABLE  OF  DISTANCES  IN  KM. 

FOR  SEPARATION  DISTANCE  OF  2,0  KM 


_Table 
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Table  2 


633 


TABLE  OF  DISTANCES  IN  KM. 

FOR  SEPARATION  DISTANCE  OF  4.0  KM 


\ 

CONFIDENCE  LEVEL 

SIGMA  | 

I M  )  | 

.99 

.95 

.90 

.75 

.50 

1 

100  | 

3.77 

3.64 

3.87 

3.93 

4.00 

150  | 

3  .66 

3.76 

3.81 

3.90 

4.00 

1 

20  0  | 

1 

3.54 

3.68 

3.75 

3.B7 

4.01 

2  50  | 

3.43 

3.60 

3.69 

3.84 

4.01 

1 

_  300  | 

.  3-32.  . 

3.52 

3.63 

3.01 

4.01 

1 

350  | 

3.21 

3.44 

3.57 

3.78 

4.02 

400  | 

3.10 

3.37 

3.51 

3.75 

4.02 

450  | 

2.99 

3.29 

3.45 

3.73 

4.03 

1  | 

:  500  i 

2.68 

3.22 

3.40 

3.70 

4.03 

550  | 

» 

2.78 

3.14 

3.34 

3.67 

4.04 

t 

600  | 

2.67 

3.07 

3.23 

3.65 

4.05 

1 

650  | 

2.57 

3.00 

3.23 

3.62 

4.06 

700  r 

1 

2.46 

2.93 

3.18 

3.60 

4.06 

750  { 

2.36 

2.66 

.  3.12 

3.57 ....  __ 

4.07 

1 

b  00  | 

2.26 

2.79 

3.07 

3.55 

4.08 

850  | 

j 

2.16 

2.72 

3.02 

3.53 

4.09 

900  j 

2.06 

2.65 

2.97 

3.51  .  „„ 

4.11  .  _  _  _ 

1 

950  | 

1.97 

2.59 

2.92 

3.49 

4.12 

1000  | 

1.67 

2.52 

2.37 

3.47 

4.13 

• 

Table  3 


TABLE 

OF  DISTANCES  IN  KM. 

FOR  SEPARATION  DISTANCE  OF  5,0 

KM. 

l 

CONFIDENCE  LEVEL 

SIGH  A  | 

j  " 

;  I 

IN)  J 

1 

.99 

.  ...95 

.90 

.75 

_  _  .50.. 

'  1 

:L  . 

Too  1 

i 

4. 77 

4  •  64 

4.37 

4.93 

,.i 

i 

150  | 

■ 

.  4.o6 

4-76 . 

4.81  , 

4.90 

_ 5.00 

j 

j 

200  | 

I 

4.54 

4.69 

4.75 

4.97 

5.01 

250  j 

1 

4.43 

4.60 

4.69 

4.64 

5.01 

1 

300  | 

j 

4.32 _ 

4.52 _ 

4.63 _ 

4.91 

_  5.01 

j 

, } 

350  | 

4.21 

4.44 

4.57 

4.76 

5.01 

i 

■400  | 

i 

4.10 

4.36 

4.51 

4.75 

5.02 

450  \ 

3.99 

4.29 

4.45 

4.72 

5.02. 

I 

'i 

c 

1 

500  | 

3.63 

4.21 

4.39 

4.69 

5.03 

a\ 

550  | 

3.77 

4,13 

4.33 

4.66 

5.03 

-P' 

1 

600  ( 

3.66 

4,06 

4.27  . 

.4.64 

5.04 

■  ! 

1 

i 

i 

650  | 

3.55 

3.96 

4.22 

4.61 

5.05 

700  | 

j 

3.45 

3.91 

4.16 

4.58 

5.05 

j' 

i 

750  \ 

3.34 

3.84 

4.11 

4*56 

5.06 

i 

"1 

1 

eoo  1 

. 1 

3.24 

3.76 

4.05 

4.53 

5.07 

B50  ) 

1 

3.13 

3.69 

4.00 

4.51 

5.06 

1 

900  ) 

3.03 

3.62 

3.94 

4.49 

5.09 

i 

:i 

1 

950  1 

2.93 

3.55 

3.09 

4.46 

5.09 

■' 

1000  I 

1 

2.03 

3.43 

3.34 

4.44 

5.10 

i 

t 


Table  4. 


:  635 


TA9L6  OF  DISTANCES  IN  KM. 

FOR  SEPARATION  DISTANCE  OF  6.0  KM. 
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Table  of  Di stances  in  Kilometers  for 
Separation  Distance  of  7.0  km 
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TABLE  OF  DISTANCES  IN  KM. 

FCR  SEPARATION  DISTANCE  OF  9.0  KM 
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Table  11. 

significant  question  is  the  dispersion  required  between  the  elements  to 
alleviate  bonus  damage*  It  is  this  question  that  the  algorithm  in  this  report 
is  well-suited  to  address*  A  similar  question  arises  in  the  case  of  the 
desirable  separation  between  redundant  command  posts  in  an  architecture  in 
which  survivaiblity  is  increased  by  simply  multiplying  the  number  of  command 
posts;  this  increase  in  survivability  is  then  traded  off  with  the  cost  of 
duplication. 

The  answer  to  the  question  of  optimal  separation  depends  on  the 
environmental  criteria  pertinent  to  the  units  under  study.  For  example, 
suppose  the  command  posts  are  designed  to  withstand  2600  rad  total  radiation 
dose  to  personnel,  3  psi  peak  static  overpressure,  and  104  v/m  EMP  vertical 
electric  field.  (The  effects  of  thermal  on  personnel  are  not  considered  since 
most  personnel  in  a  command  post  should  be  in  a  trailer  or  other  protected 
structure.)  By  means  of  the  environment  formulas  discussed  in  section  2, 
these  criteria  occur  at  a  specific  distance,  D,  from  a  weapon  burst  of  a  given 
yield.  For  example,  the  formula  for  peak  static  overpressure  is  given  by: 

Ap  (psi)  -  1.61  /  "1/3)  1,70  ^ 

'  w  •  1  /3 

for  a  weapon  of  yield  w  in  KT  at  a  scaled  height  of  burst  of  60  w  '  m.  Other 
formulas  may  be  found  in  the  cited  reference.  Suppose  the  likely  threats  to 
the  command  posts  are  300-  and  600-kT  weapons  delivered  with  a  total  standard 
deviation,  0,  of  500  m,  and  that  bonus  damage  is  to  be  precluded  with  a 
confidence  C  -  0.95.  The  environmental  criteria  then  correspond  to  the 
ranges,  D,  shown  in  Table  12;  and  interpolation  between  the  appropriate  Tables 
1  through  11  with  D  as  the  search  variable  produces  the  separation  distances, 
S,  shown. 


w  * 

300  kT 

w  =  600  kT 

D 

S 

D 

S 

Criterion 

(km) 

(km) 

(km) 

(km) 

2600  rad 

1.9 

2-7  | 

2.1 

2.9 

3  psi 

4.6 

5.2 

5.9 

6.7 

10 4  v/m 

3,2 

4.0 

3.6 

4.2 

Table  12.  Separation  distance  calculations 


As  can  be  seen,  overpressure  is  the  dominant  of  the  environments 
considered;  and  a  separation  distance  of  at  least  6.7  km  between  the  command 
post  elements  or  the  redundant  command  posts  is  required  to  preclude  bonus 
damage  from  the  assumed  threat  with  95%  confidence. 
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4.2  Weapon  Employment 


In  this  example  Blue  targeteers  are  planning  an  attack  strategy  against 
particular  Red  radars  integral  to  air  defense  units  which  are  typically 
located  only  2  to  2.5  km  apart.  The  Blue  weapon  to  be  used  against  the  radars 
is  a  50-kT  missile  which  is  delivered  at  two-thirds  maximum  range  with  a 
standard  deviation,  o,  of  250  m.  The  dominant  kill  mechanism  to  the  radars  is 
blast  and  the  peak  static  overpressure  damage  criterion  is  2.4  psi.  This 
overpressure  level  occurs  at  a  distance,  D,  of  2.91  km  from  such  a  weapon 
burst. 

For  a  50%  level  of  confidence,  interpolation  in  Tables  1  through  11  gives 
a  separation  distance,  S,  of  2.89  km,  as  that  distance  at  which  the 
probability  is  50%  that  bonus  damage  at  the  adjacent  target  will  exceed  the 
overpressure  criterion.  Since  the  air  defense  units  are  pairwise  separated  by 
less  than  2.89  km,  there  is  a  greater  than  50%  probability  that  firing  a  round 
at  one  of  the  Red  radars  will  also  produce  blast  damage  to  the  radar  located 
in  the  neighbor  unit. 


4.3  Hardening/Operational  Deployment  Trade-off 

This  example  examines  the  effectiveness  trade-off  between  the 
operational  separation  maintained  between  Army  units  of  a  certain  type  and  the 
hardening  of  the  equipment  in  these  units  to  various  levels  of  nuclear 
blast.  Suppose  the  likely  Red  threat  to  this  type  of  unit  is  a  600  kT  weapon 
delivered  with  a  total  standard  deviation,  a,  of  500  m.  The  dominant  kill 
mechanism  to  this  unit  is  taken  to  be  peak  static  overpressure,  Ap,  to 
various  critical  pieces  of  equipment  in  the  unit. 

Three  levels  of  hardening  of  this  equipment  will  be  considered: 

•  1  psi  (sure-safe  criterion) 

•  2  psi 

•  5.8  psi  (man- survivability  criterion) 

The  distances,  D,  at  which  these  levels  of  Ap  are  encountered  are  calculated 
from  equation  (3)  and  presented  in  Table  13.  If  it  is  desired  to  preclude 
bonus  damage  to  adjacent  units  of  the  same  type  at  a  confidence  level  of  95%, 
then  interpolation  in  Tables  1  through  11  yields  the  separation  distances,  S, 
which  must  be  maintained  between  the  units,  as  shown  in  Table  13. 


Blast 

Criterion 

(psi) 

D 

(km) 

S 

(km) 

1  ■ 

11.2 

12.0 

2 

7.4 

8.2 

5.8 

4.0 

4.8 

Table  13.  Hardening  level  vs*  separation  distance 
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Consequently  if  the  equipment  in  these  units  is  vulnerable  to  blast  levels 
of  1  psi,  then  the  units  must  be  located  at  least  12  tan  apart  to  ensure  that 
there  is  no  more  than  a  5%  probability  that  a  round  aimed  at  one  such  unit 
could  not  only  destroy  that  unit  but  also  inflict  bonus  damage  on  the  neighbor 
unit. 


On  the  other  hand,  if  the  units  are  hardened  to  the  man- survivability 
level  of  5.8  psi,  then  they  can  be  deployed  as  close  as  4.8  km  apart  and  be 
protected  from  bonus  damage  (with  95%  confidence) . 


5.  CONCLUSIONS 


In  conclusion,  this  paper  has  presented  an  algorithm  for  assessing  bonus 
damage  which  accounts  for  weapon-delivery  errors  and  target-location  error, 
and  attaches  confidence  bounds  to  the  results  derived.  This  methodology  is 
applicable  to  a  wide  spectrum  of  problems,  examples  of  which  have  been 
provided;  the  reader  should  be  able  to  discover  other  appropriate  problems 
from  his  own  experience. 
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A  GENERALIZED  RAYLEIGH-RITZ  METHOD  FOR  STRUCTURAL  DYNAMICS 
PROBLEMS  IN  CONJUNCTION  WITH  FINITE  ELEMENTS 

Julian  J,  Wu 

U.S.  Army  Armament,  Munitions,  and  Chemical  Command 
Armament  Research  and  Development  Center 
Large  Caliber  Weapon  Systems  Laboratory 
Benet  Weapons  Laboratory 
Watervliet,  NY  12189 


ABSTRACT.  A  solution  formulation  of  generalized  Rayleigh-Ritz  method  is 
described  and  applied  to  two  initial  and  boundary  value  problems  of  stress 
waves  and  structural  dynamics  in  conjunction  with  finite  element  discretiza¬ 
tion.  Excellent  numerical  results  have  been  obtained  for  wave  equations 
associated  with  lateral  and  longitudinal  vibrations  and  with  strong  disconti¬ 
nuities. 

I.  INTRODUCTION.  This  paper  describes  a  solution  formulation  for  and 
its  applications  to  initial  boundary  value  problems  of  structural  dynamics  and 
stress  waves.  Excellent  numerical  results  are  stated  in  conjunction  with 
finite  element  discretization.  The  basic  concept  of  this  approach  is  to 
establish  a  variational  problem  equivalent  to  a  given  initial  boundary  value 
problem,  which  is  in  general,  non-self -adjoint ,  through  the  use  of  an  adjoint 
field  variable  and  the  use  of  some  large  "spring"  constants  so  that  all  the 
end  conditions  can  be  transformed  into  natural  "boundary"  conditions. 
Therefore,  the  shape  functions  used  need  not  satisfy  any  end  conditions  a 
priori  in  solving  the  variational  problem  in  the  same  manner  as  applying  the 
Rayleigh-Ritz  method  for  self-adjoint  problems.  This  same  concept  was 
demonstrated  In  solving  initial  value  problems  in  a  paper  delivered  at  the 
International  Symposium  on  Numerical  Methods  in  Engineering  Science  series  in 
1978  and  later  published  In  the  Journal  of  Sound  and  Vibration  [1],  In  this 
present  paper,  the  formulation  is  extended  to  initial  boundary  value  problems 
and  the  numerical  results  obtained  are  also  encouraging. 

In  the  section  which  follows  immediately,  two  Initial  boundary  value 
problems  are  stated.  One  is  a  longitudinal  stress  wave  problem  In  a  rod. 

There  is  a  discontinuity  in  the  initial  data  given.  Wie  wish  to  trace  this 
discontinuity  in  the  numerical  solution  using  the  present  approach.  The 
second  problem  is  a  beam  vibration  problem  under  a  moving  concentrated  load. 
This  is  a  much  more  difficult  problem  since  the  partial  differential  equation 
is  of  fourth  order  and  the  force  is  singular  in  nature.  In  the  next  section, 
variational  problems  equivalent  to  the  given  initial  boundary  problems  are 
established.  The,  finite  element  discretization  procedures  are  then  briefly 
recaptured.  Lastly,  numerical  results  are  presented  with  some  comments, 

II .  INITIAL  BOUNDARY  VALUE  PROBLEMS .  Two  initial  boundary  problems  of 
structural  dynamics  will  be  stated  in  this  Section.  The  first  one  is  of 
longitudinal  elastic  stress  wave  in  a  rod  with  a  sudden  change  in  initial 
conditions.  The  second  one  is  concerned  with  lateral  vibrations  of  a 
Euler-Bemoulli  beam  subjected  to  a  moving  concentrated  load. 
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Longitudinal  Stress  Wave  In  a  Rod.  The  rod  is  fixed  at  one  end  and  free 
at  the  other  end.  The  discontinuity  data  arises  from  the  initial  linear 
displacement,  corresponding  to  a  constant  stress,  due  to  a  force  applied  at 
the  "free"  end.  This  force  suddenly  disappears  at  time  zero  causing  a  stress 
discontinuity  at  the  free  end.  The  differential  equation  can  be  written  as: 


with 


1_  3*u 
3x2  a2  3t2 


0  <  x  <  SL 
0  <  t  <  T 


(1) 


a2  =  E/p 


(2) 


where  u  -  u(x,t)  is  the  axial  displacement;  x,t  are  the  coordinates  in  axial 
direction  and  in  time,  respectively;  P,E  are  density  and  Young’s  modulus, 
respectively,  of  the  rod  material;  l  denotes  length  of  the  rod;  and  T  denotes 
some  finite  time  of  interest. 


For  the  boundary  conditions,  we  have 

u(0,t)  -  0 

and 

3U  (3) 

“  (i,t)  -  0 

Sx 


The  dynamics  of  the  problem  are  due  to  the  initial  conditions.  It  is  assumed 
that  the  rod  is  stretched  to  a  linear  displacement  by  a  force  P  which  vanishes 
at  time  t  >  0  (see  Figure  1).  The  initial  velocity  of  the  rod  is  assumed  to 
be  zero.  Thus 


and 


P 

u(x,0)  *  —  x  ;  and 
AE 

3u 

—  (x,0)  -  0 


(4) 


It  is  convenient  to  use  dimensionless  parameters.  Let 

u*  =■  u/£  ,  x*  =  x/i  ,  t*  -  t/T 

Then,  Eq.  (1)  in  dimensionless  form  is 

32u*  2  32u*  0  <  x*  <  l 

3x*2  3t*2  *  o  <  t*  <  1 


(5) 


(6) 
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where 


The  boundary  conditions  become 

3u* 

u*(0,t*)  =  0  ,  -  (l,t*)  =  0 

3x* 

and 

3u* 

u*(x,0)  =  P*x*  ;  —  (x*,0)  *  0 

3t* 

where 

P 

P*  =  — 

AE 


(7) 

(8) 

(9) 

(10) 


is  the  force  in  dimensionless  form. 


Figure  1.  A  Rod  Fixed  at  One  End  and  Subjected  to  a  Load  P,  which  is 
Suddenly  Released  at  Time  Zero. 
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The  stated  problem  in  dimensionless  form  combines  Eqs.  (6),  (8),  and  (9) 
with  the  new  dimensionless  parameters  related  to  physical  counterparts  by  Eqs, 
(5),  (6),  and  (10).  To  simplify  writing,  we  shall  drop  the  asterisks  (*)  in 
Eqs.  (6),  (8),  and  (9),  and  rewrite  them  as 


0  <  x  <  1 

u”  -  b2u  =0  ; 

0  <  t  <  1 
u(0ft>  -  0  ;  u'(l,t)  -  0 

u(x,t)  =  Px  ;  u(x,0)  -  0 


(6*) 

(8’) 

(9*) 


where  a  prime  (f)  indicates  differentiation  with  respect  to  x  and  a  dot  (*), 
with  respect  to  t. 

Beam  Vibrations  Under  Moving  Loads.  Let  us  consider  the  differential 
equation  of  a  uniform  Euler-Bernoulli  beam  subjected  to  a  moving,  concentrated 
force. 


0  <  x  <  l 

Ely”'*  +  pAy  =  P5(xp-x)  (11) 

0  <  t  <  T 


where 


E,P 

= 

Young’s  modulus,  density  of  the  beam  material 

I.A 

= 

second  moment,  area  of  the  beam's  cross-section 

£ 

= 

length  of  the  beam 

y=y(x,t) 

a 

beam  deflection 

x,t 

s 

coordinates  in  beam’s  axial  direction  and  in  time 

P 

a 

magnitude  of  the  concentrated  force 

«(x) 

s 

Dirac  delta  function 

Xp=Xp(t) 

= 

location  of  P 

T 

some  finite  time  of  interest 

Again  it  will  be  convenient  to  employ  nondimens ional  parameters  and 
equations.  These  will  be  introduced  by  way  of  Eq,  (11),  Thus,  let 


y*  =  y/ i  ,  x*  =  x/£  , 

Using  Eq,  (12)  in  Eq.  (11),  one  has 

y  *"’*  +  Y2y*  =  Q6(xp*-x*) 


t*  -  t/T 

0  <  x*  <  1 
0  <  x*  <  1 


where 


Q  »  P* 


P£2 

El 


(12) 

(13) 


(H) 
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Also  note  in  Eq.  (13)  that  the  differentiations  are  now  with  respect  to  the 
nonditnensionalized  variables  x*  ant  t*.  From  now  on,  we  shall  use  Eq.  (13) 
with  the  asterisks  dropped  altogether. 


0  <  x  <  1 

y“"  +  ky  +  y2y  =  QS(xp-x)  (15) 

0  <  t  <  I 

Ill.  VARIATIONAL  PROBLEMS  -  A  GENERALIZED  RAYLEIGH-RITZ  METHOD  *  For  the 
stress  wave  problem  in  the  previous  section,  consider  a  variational  problem. 

<$IQ  =  0  (16a) 

with 

I0  58  !o^u»v)  “  /Q  Jq  (-u,v,+b2uv)dxdt  (16b) 

where  u(x,t)  and  v(x,t)  are  said  to  be  adjoint  to  each  other.  It  Is  a  simple 
matter  to  see  that  this  problem  is  an  indeterminate  one.  However,  the 
functional  of  Eq,  (16b)  can  be  modified  to  a  variational  problem  which  is 

equivalent  to  the  boundary/ Initial  problem  of  Eqs.  (6'),  (8*),  and  (9*).  Thus 

consider 

<51  -  0  (17a) 

with  ii 

I  =  I(u,v)  -  /  /  (-u,v*+b2uv)dxdt 

0  0 

,1 

+  ki/  u(0,t)v(Q,t)dt 

0 

+  k2b2/^  [u(x ,0)  -  u0(x)]v(x,l)dx  +  b2  ui(x)v(x,0)dx  (17b) 

We  shall  take  the  first  variation  of  the  function  I(u,v)  of  Eq.  (17b)  in 

such  a  manner  that  <5v  Is  completely  arbitrary  while  <5u  Is  set  to  zero 

identically.  Hence,  by  means  of  integration-by-parts,  one  has 

1  A 

(6I)<$u=o  "  /q  Jq  (u"-b2u)  6vdxdt 

A 

- ;  u'(i,t)Mi,t)dt 

1 

+  f  [u(0,t)  +  kiu(0,t)]6v(0,t)dt 
+  b2/^  tu(x,l)  +k2[u(x,0)  -  u0(x) J }6v(x,l)dx 

9  A  r. 

-  b  /  [u(x,0)  -  ui(x) ] Sv(x,0)dx  =  0  (18) 
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The  fact  that  Sv(x,t)  is  completely  arbitrary  enables  us  to  conclude  from  Eq. 
(18)  that 

0  <  x  <  1 

u"  -  b2u  =  0  ;  (19a) 

0  <  t  <  1 


u'( l,t)  -  0 
u’COjt)  +  kiu(0,t)  -  0 

u(x,l)  +  k2[u(x,0)  -  u0(x)]  =*  0  (19b) 

and 

u(x,0)  -  ui(x)  =*  0 

It  Is  then  observed  that  the  initial  boundary  value  problem  defined  by  Eqs. 
(19a)  and  (19b)  reduces  to  that  of  Eqs,  (6»),  (8f),  and  (9*)  if  one  lets  kj 
and  k2  go  to  infinity*  (and  with  uq(x)  -  Px  and  ui(x)  =  0).  This  fact 
suggests  that  the  variational  problem  of  Eqs.  (17a)  and  (17b)  can  be  used  as  a 
basis  of  a  finite  element  discretization  for  the  approximate  solutions  to  the 
original  initial  boundary  problem.  It  should  be  noted  that  all  the  auxiliary 
conditions  in  Eqs.  (19a)  and  (19b)  are  the  so-called  natural  boundary 
conditions.  They  are  the  consequence  of  the  variational  problem  -  just  like 
the  differential  equation  itself.  For  this  reason,  the  above  solution  is 
referred  to  as  a  Generalized  Rayleigh-Ritz  Method, 

By  a  similar  process,  one  can  establish  a  variational  problem  for  the 
vibration  problem  of  a  beam  under  a  moving  load.  In  this  case,  one  has 


51  ■  [u" Sv"  -  u6v  -  5(x-x) 5v]dxdt 

,1 

+  /  [kiu(0,t)6v(0,t)  +  k2Uf (0,t)6v? (0,t) 


+  k3u(l  ,t)<5v(  l,t)  +  k4uf(  1  ,t)6v*  (  1  »t)  ]dt 


+ 


[k5u(x,0)6v(x,l)  +  k6u(x,0)5v(x,0) ]dx 


0 


(20) 


*This  process  is  sometlnes  referred  to  as  the  penalty  function  method.  See, 
for  example.  Reference  [2], 
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Through  integrations-by-parts, 


,1  rl  "  “  - 

51  =  /  [u"M  +  u  -  6(x-x) ] 6v(x ,t)dxdt 

+  J*  { [klu(0,t)  +  u"f(0,t)]5v(0,t)  +  [k2u* (0,t)  -  u’*(0,t) J 6v' (0,t) 
+  [k3u(l,t)  -  u",(l,t)]6v(l,t)  +  [k4u'(l,t)  +  u"( 1 , t) ] Sv' ( 1 ,t) >dt 


+  / 


1 

0 


( [k5(u(xf0)-0)  -  u(x,t)]6v(x,l)  +  (k&+l)  [u(x,0)-0]  5v(x,0)  }dx 


0  (21) 


The  original  differential  equation  and  the  boundary  and  initial  conditions  are 
recovered  from  the  equation  above  due  to  the  arbitrariness  of  the  variations 
6(x,t)  and  by  properly  selecting  the  values  of  k-^,3,  i  =  1,2,..., 6. 


IV.  FINITE  ELEMENT  DISCRETIZATION.  Only  essential  features  will  be 
stated  in  the  finite  element  discretizations  here.  The  region  of  a  unit 
square  (0  <  x  <  1;  0  <  t  <  1)  is  further  divided  into  KxL  rectangles  by  taking 
K  divisions  in  x  direction  and  L  divisions  in  t  direction.  Local  coordinates 
(£,n)  in  each  (i,j)th  element  are  related  to  (x,t)  by  these  equations*. 


£  =  -  Kx  -  i  +  1 

n  =  n(j)  *  Lt  -  j  +1 


(22) 


Within  each  element,  the  unknown  function  u(x,t)  is  replaced  by  the 
approximation: 


“  aTU,n)  U(i,j)  ( 

5v(i,j)(5»n>  "  aT(C,n)  SVd.^) 

where  a(£,ri)  is  the  shape  function  vector  and  U(^  j),  6V(^  j)  are  the 
generalized  coordinates.  The  specific  form  of  a(|,n)  employed  here  is  such 
that  each  one  of  the  sixteen  components  is: 


k  =  1,2 . ,16 

*kU>n)  =  bi(C)bj  (n)  ,  (24) 

I,j  =  1*2, 3, 4 


with 

bi(e)  =  1  -  352  +  2£3 
b2(S)  =  5  -  2?2  +  E3 
b3(S)  =  352  -  253 
HU)  -  -42  +  53 


(25) 
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and  the  relations  between  index  k  and  the  pair  (i,j)  are  given  in  Table  I. 
TABLE  I.  RELATIONSHIP  BETWEEN  (i,j)  AND  k  IN  EQUATION  (24) 


n 

|HH 

*■ 

(i,j) 

n 

(1,3) 

(2,1) 

Bft: 

(2,3) 

(1,2) 

11 

(1,4) 

(2,2) 

12 

(2,4) 

(3,1) 

13 

(3,3) 

(4,0 

14 

(4,3) 

(3,2) 

15 

(3,4) 

1  8 

(4.2) 

_ 16__, 

(4.4) 

Using  Eqs.  (22)  through  (25)  in  Eq.  (17)  and  the  fact  that  is 

completely  arbitrary,  the  matrix  equations  for  the  unknowns  U^^j)  can  be 
routinely  assembled  and  solved.  Further  details  will  be  omitted  here. 

V.  NUMERICAL  RESULTS  AND  DISCUSSION.  Some  of  the  numerical  results  are 
presented  in  this  section.  For  the  stress  wave  problem*,  Table  II  provides 

solutions  of  v(x,t),  3u/3x(x,t)  and  9u/3t(x,t)  for  x  -  0,  0.1,  0.2,  ...1.0  and 

for  t  =»  0,  0.5,  1.0,  1.5,  and  2.0.  During  this  time  interval,  the  original 
displacement  has  gone  through  a  complete  sign  reversal  as  shown  in  Figure  2. 

This  particular  set  of  data  was  obtained  by  taking  K  =  10  and  L  *  1  with 

restart  procedures,  i.e.,  the  final  solution  in  the  first  time  step  was  taken 
as  the  initial  condition  of  the  next  step  in  time,  and  so  on.  Values  of  the 
exact  solutions  are  given  in  parentheses.  Excellent  agreement  is  observed. 

The  fact  that  the  discontinuity  of  the  solution  follows  along  without  much 
oscillation  is  worth  mentioning. 

For  the  beam  vibration  problem  with  a  moving  force,  some  typical 
numerical  solutions  are  given  in  Tables  III  and  IV.  The  moving  concentrated 
force  is  assumed  to  travel  at  a  constant  velocity  c  (although  this  is  not  at 
all  a  restriction  for  the  present  method)  such  that 

x(t)  -  ct 

where  c  is  dimensionless  velocity.  For  small  c,  c  =  0.0001,  and  the 
displacement  solutions  become  those  of  static  deflections  as  shown  in  Table 
III.  For  a  large  c  (compared  with  unity),  c  ~  10,  and  solutions  show  dynamic 
effects  as  Indicated  in  Table  IV,  As  a  comparison,  solutions  obtained  by  the 
Fourier  series  and  Laplace  transform  method  [4]  are  given  in  parentheses. 

Good  agreement  exists  even  in  cases  with  considerable  dynamic  effect. 


*For  exact  solution  to  this  problem,  see  for  example,  Reference  [3], 
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TABLE  II.  SOLUTIONS  TO  THE  STRESS  WAVE  PROBLEM  OF  EQS.  (6'),  (8')  and  (9')  WITH  b  =  1.0,  P  =  1.0 

(PART  L) 


Data  at  Time  t  s 

=  0.0 

Data  at  Time  t  = 

3.50 

Data 

at  Time  t  = 

1.00 

X 

u(x»t) 

3u/3x 

3u/3t 

u(x,t) 

3u/3x 

3u7"3t 

u(xft) 

3u/3x 

3u75t 

0.0 

0.00000 

(0.00000) 

1.00000 

(1.00000) 

0.00000 

(0.00000) 

-0.00000 

(0.00000) 

0.99861 

(1.00000) 

0.00000 

(0.00000) 

0.00000 

(0.00000) 

-0.11718 

(0,00000) 

0.00000 

(0.00000) 

0.10 

0.10000 

(0.10000) 

1.00000 

(1.00000) 

0.00000 

(0.00000) 

0.09998 

(0.10000) 

0.99740 

(1.00000) 

-0.00142 

(0.00000) 

-0.01236 

(0.00000) 

0.18416 

(0.00000) 

-1.07661 

(-1.00000) 

0.20 

0.20000 

(0.20000) 

1.00000 

(1.00000) 

0.00000 

(0.00000) 

0.19994 

(0.20000) 

0.99113 

(1.00000) 

-0.00618 

(0.00000) 

0.00259 

(0.00000) 

-0.10702 

(0.00000) 

-1.09024 

(-1.00000) 

0.30 

0.30000 

(0.30000) 

1.00000 

(1.00000) 

0.00000 

(0.00000) 

0.29949 

(0.30000) 

0.97359 

(1.00000) 

-0.01886 

(0.00000) 

-0.00026 

(0.00000) 

-0.00077 

(0.00000) 

-0.92271 

(-1.00000) 

0.40 

0.40000 

(0.40000) 

1.00000 

(1.00000) 

0.00000 

(0.00000) 

0.40354 

(0.40000) 

1. 06038 
(1.00000) 

0.07965 

(0.00000) 

0.00035 

(0.00000) 

-0.00953 

(0.00000) 

-1.0347  6 
(-1.00000) 

0.50 

0.50000 

(0.50000) 

1.00000 

(1.00000) 

0.00000 

(0.00000) 

0.49976 

(0.50000) 

0.47813 

(1.00000) 

-0.54638 

(0.00000) 

-0.00036 

(0.00000) 

-0.00287 

(0.00000) 

-0.95721 

(-1.00000) 

0.60 

0.60000 

(0.60000) 

1.00000 

(1.00000) 

0.00000 

(0.00000) 

0.49785 

(0.50000) 

0.02932 

(0.00000) 

-0.96310 

(-1.00000) 

0.00042 

(0.00000) 

-0.00334 

(0.00000) 

-1.04064 

(-1.00000) 

0.70 

0.70000 

(0.70000) 

1.00000 

(1.00000) 

0.00000 

(0.00000) 

0.50081 

(0.50000) 

-0.02913 

(0.00000) 

-1.06490 

(-1.00000) 

-0.00039 

(0.00000) 

1  0.00046 
(0.00000) 

-0.95841 

(-1.00000) 

0.80 

0.80000 

(0.80000) 

1.00000 

(1.00000) 

0.00000 

(0.00000) 

0.49983 

(0.50000) 

0.00258 

(0.00000) 

-0.95849 

(-1.00000) 

0.00041 

(0.00000) 

-0.00086 

(0.00000) 

-1.04185 

(-1.00000) 

0.90 

0.90000 

(0.90000) 

1.00000 

(1.00000) 

0.00000 

(0.00000) 

0.50019 

(0.50000) 

0.00036 

(0.00000) 

-1.03927 

(-1.00000) 

-0.00040 

(0.00000) 

-0.00012 

(0.00000) 

-0.95859 

(-1.00000) 

1.00 

1.00000 

(1.00000) 

0.00000 

(0.00000) 

0.00000 

(0.00000) 

0.49982 

(0.50000) 

-0.00000 

(0.00000) 

-0.96235 

(-1.00000) 

0.00041 

(0.00000) 

0.00000 

(0.00000) 

-1.04164 

(-1.00000) 

^Figures  in  parentheses  ind 


II cate  exact  solutions 
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TABLE  II.  SOLUTIONS  'TO  THE  STRESS  WAVE  PROBLEM  OF  EQS.  (6'),  (81  ) ,  and  (9')  with  b  =  1.0*  P  =  1.0. 

(PART  2) 


Data  at  Time  t  = 

1.50 

Data  at  Time  t  - 

i.00 

X 

u(x,t) 

3u/3x 

3u/3t 

u(x,t) 

3u/3x 

3u/3 1 

0.0 

0.00000 

-0.99528 

0.00000 

0.00000 

-0.02019 

-0.00000 

(0.00000) 

(-1.00000) 

(0.00000) 

(0.00000) 

(-1.00000) 

(0.00000) 

0.10 

-0.9975 

-1.01418 

0.00235 

-0.10003 

-0.98040 

0.00165 

(-0.10000) 

(-1.00000) 

(0.00000) 

(-0.10000) 

(-1.00000) 

(0.00000) 

0.20 

-0.19988 

-0.93268 

-0.04972 

-0.19992 

-0.01911 

-0.00119 

(-0.20000) 

(-1.00000) 

(0.00000) 

(-0.20000) 

(-1.00000) 

(0.00000) 

0.30 

-0.30222 

-1.13452 

0.09812 

-0.30008 

-0.97758 

0.00610 

(-0.30000) 

(-1.00000) 

(0.00000) 

(-0.30000) 

(-1.00000) 

(0.00000) 

0.40 

-0.39324 

-0.62190 

-0.02522 

-0.39981 

-0.00984 

-0.00308 

(-0.40000) 

(-1.00000) 

(0.00000) 

(-0.40000) 

(-1.00000) 

(0.00000) 

0.50 

-0.49607 

0.19515 

-0.40290 

-0,50035 

-0.98060 

-0.02259 

(-0.50000) 

(0.00000) 

(0.00000) 

(-0.50000) 

(-1.00000) 

(0.00000) 

0.60 

-0.51154 

0.00219 

-1.14247 

-0,59945 

-1.01522 

0.02001 

(-0.50000) 

(0.00000) 

(-1.00000) 

(-0,60000) 

(-1.00000) 

(0.00000) 

0.70 

-0.49849 

-0.00219 

-1.05659 

-0.69929 

0.84502 

-0.05514 

(-0.50000) 

(0.00000) 

(-1.00000) 

(-0.70000) 

(-1.00000) 

(0.00000) 

0.80 

-0.50034 

-0.02908 

-0.92639 

-0.80180 

-0.84502 

0.07546 

(-0.50000) 

(0.00000) 

(-1.00000) 

(-0.80000) 

(-1.00000) 

(0,00000) 

0.90 

-0.49935 

-0.01126 

-1.03406 

-0.90529 

-1.19080 

-0.23643 

(-0.50000) 

(0.00000) 

(-1.00000) 

(-0.90000) 

(-1.00000) 

(0.00000) 

1.00 

-0.50051 

-0.00000 

-0.93441 

-0.98802 

-0.00000 

0.31500 

(-0.50000) 

(0.00000) 

(-1.00000) 

(-1.0000) 

(-1.00000) 

(0.00000) 

*Figures  in  parentheses  indicate  exact  solutions. 
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TABLE  III.  SOLUTIONS  u(x,t)  TO  THE  MOVING  FORCE  PROBLEM  OF  E Q.  (15) 
WITH  Q  =  1,0  AND  FIXED  END  CONDITIONS  AT  x  =  0. 

(For  very  low  velocity,  f  =  10“**  in  Eq.  (14)) 


X 

t  =  0.0 

t  =  0.20 

t  =■  0.40 

t  =  0.60 

t  =  0.80 

t  =  1.00 

0.0 

0.000000 

(Given) 

0.000000 

(0.000000) 

0.000000 

(0.000000) 

0.000000 

(0.000000) 

0.000000 

(0.000000) 

-0.000000 

(0.000000) 

0.20 

0.000001 

(Given) 

0.008533 

(0.009534) 

0.011999 

(0.012000) 

0.010665 

(0.010667) 

0.006125 

(0.006133) 

-0.000097 

(0.000000) 

0.40 

-0.000001 

(Given) 

0.012001 

(0.012000) 

0.019206 

(0.019199) 

0.018157 

(0.018134) 

0.010763 

(0.010666) 

0.001174 

(0.000000) 

0.60 

-0.000000 

(Given) 

0.010668 

(0.010667) 

0.018137 

(0.018133) 

0.019214 

(0.019201) 

0.012057 

(0.012000) 

0.000691 

(0.000000) 

0.80 

0.000001 

(Given) 

0.006133 

(0.006134) 

0.010664 

(0.010666) 

0.011990 

(0.012000) 

0.008491 

(0.008533) 

-0.000509 

(0.000000) 

1.00 

-0.000000 

(Given) 

0.000000 

(0.000000) 

0.000000 

(0.000000) 

0.000000 
(b. oooooo) 

0.000000 

(0.000000) 

0.000000 

(0.000000) 

Solutions  in  parentheses  based  on  formulas  from  Reference  4 
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TABLE  IV.  SOLUTIONS  u(x,t)  TO  THE  MOVING  FORGE  PROBLEM  OF  EQ.  <15) 
WITH  Q  «  1.0  AND  FIXED  END  CONDITIONS  AT  x  =  0. 

(For  very  low  velocity,  Y  *  10  in  Eq.  (14)) 


X 

t  -  0.0 

t  =  0.20 

t  -  0.40 

t  =  0.60 

t  =  0.80 

t  -  1.00 

0.0 

-0.000000 

(Given) 

-0.000000 

(0.000000) 

-0.000000 

(0.000000) 

-0.000000 

(0.000000) 

-0.000000 

(0.000000) 

-0.000000 

(0.000000) 

0.20 

-0.000001 

(Given) 

0.000482 

(0.000467) 

0.001387 

(0.001345) 

0.002151 

(0.002046) 

0.002944 

(0.002534) 

0.007191 

(0.003643) 

0.40 

-0.000001 

(Given) 

-0.000077 

(-0.000082) 

0.001109 

(0.001109) 

0.002717 

(0.002704) 

0.004364 

(0.004375) 

0.004877 

(0.004463) 

0.60 

-0.000000 

(Given) 

0.000001 

(0.000025) 

-0.000320 

(-0.000311) 

0.001493 

(0.001504) 

0.003110 

(0.003177) 

0.005067 

(0.005601) 

0.80 

-0. 00000 L 
(Given) 

0.000003 

(-0.000012) 

-0.000013 

(0.000002) 

-0.000964 

(-0.000942) 

0.001308 

(0.001257) 

0.005751 

(0.005464) 

1.00 

-0.000000 

(Given) 

-0.000000 

(0.000000) 

0.000000 

(0.000000) 

-0.000000 

-(0.000000) 

-0.000000 

(0.000000) 

-0.000000 

(0.000000) 

Solutions  in  parentheses  based  on  formulas  from  Reference  4 


In  conclusion,  this  paper  has  demonstrated  through  examples  of 
structural  dynamics  an  approximate  solution  formulation  (which  is  both  a 
weighted  method  and  a  variational  problem),  the  finite  element  implementation, 
and  some  favorable  numerical  results.  Although  only  linear  problems  have  been 
mentioned,  an  application  to  solutions  of  non-linear  problems  is  now  being 
investigated. 
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A  Numerical  Technique  in  the  Hydrodynamic 
Theory  of  Foil  Bearings 

I.G.  Tadjbakhsh*,  G,  Ahmadi**  and  E.A.  Saibel+ 

Abstract 

The  hydrodynamic  theory  of  foil  bearings  is  reviewed.  The 
relationship  between  fluid  pressure  and  film  thickness  is  discussed. 

The  compressibility  of  gas  is  included  in  the  analysis.  It  is  shown 
that  the  basic  equation  for  determination  of  pressure  distribution 
becomes  a  third  order  boundary  value  problem  in  terms  of  film  thick¬ 
ness.  A  simple  numerical  scheme  for  solution  of  the  nonlinear  boundary 
value  problem  is  developed  and  some  examples  are  considered  and  dis¬ 
cussed. 
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1*  Introduction 


Theory  of  hydrodynamic  lubrication  is  one  of  the  well  established 
fields  of  mechanical  engineering  [1,2],  The  theory  of  hydrodynamic 
lubrication  with  deformable  boundaries  was  considered  by  Korovchinskii 
[3],  Christensen  [4,5],  Wilson  [6]  and  Mahdariaw  and  Wilson  [7],  among 
others. 

The  theory  of  foil  bearing  was  first  investigated  by  Blok  and  Van 

-  i 

Rossum  [8]*  Wildman  and  Wright  [9]  have  considered  the  effect  of  external 
pressure  on  foil  bearings  and  have  also  employed  a  perturbation  method 
for  solution  of  the  resulting  equations.  Further  developments  are  carried 
out  by  Eshel  and  Elrod  [10],  Ma  [11],  Barlow  [12]  and  more  recently  by” 
Eshel  [13,14]. 

In  the  present  investigation,  the  theory  of  gas  lubrication  with 
a  flexible  boundary  is  studied.  The  equations  of  motion 
of  a  flexible  tape  are  considered  and  under  the  assumption  of  small 
slope  a- simple  relationship  between  the  f luid'-pressure  and  film  thick¬ 
ness  is  -established.  The  basic  equation  for  the  variation  of  film  pres¬ 
sure  is  then  obtained  which  is  shown  to  be  a  nonlinear  third  order  two 
point  boundary  value  problem.  The  general  expressions  for  the  load  bearing 
capacity  and  the  friction  force  of  the  bearing  are  derived  and  discussed. 

A  simplified  scheme  for  obtaining  the  numerical  solution  of  the  formulated 
nonlinear  boundary  value  problem  is  developed  and  applied  to  several 
examples. 

2.  Basic  Equations 

We  consider  a  hydrodynamical  bearing  which  is  driven  by  a  moving 
flexible  tape  or  belt  as  shown  in  Fig.  1.  As  a  result  of  the  pressure 
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developed  within  the  lubricating  film  the  flexible  tape  deflects  from 
the  straight  line  position  joining  the  two  roller  bearings  at  the 
entrance  and  exit  of  the  bearing. 

We  assume  that  the  steadily  moving  tape  assumes  a  fixed  shape 
in  space,  i.e.,  the  tape  slides  along  a  fixed  curve  c  in  the  x-y  plane. 
Let  0  be  a  fixed  point  in  the  space  located  also  on  the  curve  of  the 
moving  belt.  Let  O'  be  a  point  on  the  moving  tape  serving  as  a  refer* 
ence  point  for  measurement  of  the  distance  S  of  the  material  points 
along  the  tape  and  which  coincides  with  the  point  0  at  a  reference  time 
tQ.  A  point  P  a  distance  S  away  from  01  along  curve  c  at  time  will 
be  a  distance  5  =  S  +  v(t*tQ)  away  from  0  at  time  t.  The  Cartesian 
coordinates  x  and  y  of  the  point  P  as  well  as  the  tension  x  of  the  tape 
at  that  point  will  be  functions  of  the  variable  The  equations  of 
motion  of  the  tape, assuming  no  resistance  to  bending,  and  inextensibility 
condition  are: 


3S  as*  +  fx  ; 

is 


CD 

(2) 


(— )2  + 


(iZ)2 


1. 


(3) 


Here  f  and  f  are  the  fluid  forces  in  x  and  y  directions  exerted  on  the 
x  y 

tape  and  y  is  the  mass  per  unit  length  of  the  tape. 

Since  bearings  generally  have  small  slopes,  it  can  be  said  that 
g  -  |f  «  1  and  hence  s  1.  Using  this  and  the  fact  that  x,  y 

and  t  are  functions  of  (1)  and  (2)  can  be  written  as 
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dx  ,  c  2  d^x  * 

11  x  “  Y  V  V  °* 


f  =  y  v 

y 


2  d2y 

2* 

ar 


(A) 


(5) 


Under  the  stated  set  of  assumptions,  we  have 


f x  "  2u  3y  ly=h  ' 


(6) 


f  =  “  P  , 

y 


(7) 


in  which  y  is  the  viscosity  of  the  lubricant  and  u  and  p  represent  the  fluid 
velocity  and  excess  (gage)  pressure  as  governed  by  the  Reynolds  equation 


5  u  1  d£ 

3y2  >*  dx‘ 


(8) 


Integrating  equation  (8)  and  using  the  boundary  conditions 


u=Q  at  y=Q  , 


(9) 


9x  .  - 

u  =  at  =  v  at  y  -  h  , 


(10) 


we  obtain 


u  =  ir  £  (y2  - hy)  +  h v- 


(id 


The  continuity  equation  is  given  by 


P*Q  s  )  pu  dy* 
a  o 


(12) 
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where  p  and  p  are  the  densities  of  gas  (air)  at  atmospheric  pressure  P 
and  at  gage  pressure  p,  respectively.  For  a  polytropic  precess,  we  have 


pa  +  P 


P 


n 


=  const. 


(13) 


n  equal  to  1.4  and  1.0  correspond  to  the  adiabatic  and  isothermal  processes, 
respectively  and  n  =  “  denotes  the  isochoric  (incompressible)  gas  flow. 
Using  equation  (11)  in  (12)  and  noting  that  p  is  only  a  function  of  x,  it 
follows  that 


Q  “  (p  )  (2  vh  -  12y  j*) . 

3 

Eliminating  p  between  equation  (13)  and  (14),  we  fim 


(14) 


q-  g>. 


The  linearized  forms  of  equations  (5)  and  (7)  lead  to 


P  '  -Yv 


2  d2h 


dx 


2  * 


Substituting  equation  (16)  into  (15),  we  find 


(15) 


(16) 
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(17) 


(i  _  12Li  A.l/n  (Yv_  .h 
U  p  ,  21  42u 

*a  dx 


3  d\ 
dx3 


+  ~2  vh)  =  Q. 


For  the  incompressible  limit,  that  is,  h  =  »,  equation  (17)  reduces,  to  that  of 
[8,9].  The  boundary  conditions 


h  =  h 

at 

x  = 

0  , 

(18) 

o 

h  =  h 

at 

x  = 

l  , 

(19) 

at 

X  - 

0 ,i  , 

(20) 

dx2 


serve  to  determine  the  flow  rate  Q  and  the  three  constants  of  integration. 


Introducing  dimensionless  variables , 

x 


T  =  * 

T  h  ’ 
o 


n  = 


(21) 


equation  (17)  and  boundary  conditions  (18)  -  (20)  become 


(1 


-l4)1/n  (T3  4  + 


dn 


dri 


j+otT)=$,  0  <  T)  <  1  , 


with 


where 


T  -  1 


T  ~  <5 

d2T 


at 

at 


n 

T» 


0  , 
1  , 


dr\4 


0  at  n  “  0,1  , 


„  =  6y£.  c  12yQi3 

Yvh-*  ’  -Yv^lr 
o  o 


6  =  ^  ,  X= 


YV 


(22) 

(23) 

(24) 

(25) 

(26) 


parameter  X  is  the  ratio  of  dynamic  pressure  to  atmospheric  pressure.  For 
small  values  of  X,  the  gas  behaves  as  an  incompressible  fluid. 
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Equations  (  22)  -  (25)  form  a  nonlinear  third  order  two  point  boundary 
value  problem  for  finding  the  dimensionless  film  thickness  T.  The 
expression  for  the  film  pressure  in  terms  of  the  dimensionless  quantities 
is  given  by 


p  =  - 


(27) 


The  load  bearing  capacity  per  unit  width  is  defined  by 

l 

P  =  /  pdx  .  ~  (28) 

o 


Employing  equation  (27)  in  (28),  the  expression  for  the  load  bearing  capa¬ 
city  in  terms  of  dimensionless  quantities  becomes 


P 


[S (0) 


d x 

dr) 


(1)1 


(29) 


From  equations  (29)  it  is  observed  that  the  bearing  capacity  is  pro¬ 
portional  to  the  square  of  tape  velocity  in  contrast  to  the  case  of  rigid 
boundaries  where  it  becomes  proportional  directly  to  the  velocity.  Further¬ 
more  P  is  related  to,  h^/i,  while  in  the  rigid  case  it  is  proportional  to 
2 

i  /ho.  It  is  of  course  recognized  that  the  terms  dT/dn  in  equation  (29)  depend 
on  the  values  of  parameters  a,B  and  X  which  are  functions  of  v,  hQ,  SL,  p  and 
etc.  Therefore,  the  dependence  of  the  load  capacity  on  various  parameters  would 
be  partially  modified,  accordingly. 

The  friction  force  per  unit  width  of  the  bearing  can  be  calculated  by 
first  obtaining  the  shear  stress  at  the  upper  boundary  and  then  integrating 
the  result  over  the  length  i.  The  final  expression  for  the  frictional 
force  D  becomes 


D 


2T-3B/2a 


dn 


(30) 
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From  equation  (30)  it  is  observed  that  the  dependence  of  D  on  the 

parameters  p,  v,  &  and  is  similar  to  that  of  a  bearing  with  rigid  boundaries. 


3,  Numerical  Solution  and  Examples 

The  numerical  solution  of  the  third  order  nonlinear  two  point 

boundary  value  problem  given  by  equations  (22)  -  (25)  becomes  rather  involved 

'  dT 

The  values  of  the  parameter  3  (which  depends  on  Q)  and  the  slope  — (0)  must 

be  guessed  in  such  a  way  that  at  rj  =  1  the  values  of  T  -  1  and 

d2T 

— =  0  be  reached  simultaneously.  Such  a  procedure  requires  a  time  consuming 
dn 

trial  and  error  calculation.  To  circumvent  the  lengthy  computation  the 


following  scheme  for  solution  is  adopted.  Let  us  assume  that 

f  (0)  =  k  . 

Introducing  a  change  of  independent  variable, 


(31) 


kn  , 


(32) 


equation  (22)  becomes 


(1  &  H-  «•  T>-  S' 


0  <  r  <  k 


(33) 


dr 


dr' 


where 


cl  *  =  k"3a,  B*  =*  k  A1  *  V.\, 


(34) 


The  boundary  conditions  (23)  -  (25)  and  equation  (31)  now  become, 

at  r  =  0  , 


_  _  ,  dT  .  d2T 

i,  dr  =  1,  dr2  "  °* 


T  =  5,  =  0  , 

dr 


at  r  =  k  , 


(35) 

(36) 
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For  fixed  values  of  af,B'  and  X'  (assumed  values)  equation  (33)  can  be 

2  2 

integrated  numerically  with  initial  values  of  T,  dT/dr  and  d  T/dr  given 

2  2 

by  equation  (35),  Integration  is  carried  out  until  d  T/dr  becomes  equal  to 

2  2 

zero.  The  corresponding  r  for  which  d  T/dr  =  0  determines  the  value  of  the 
parameter  k  and  the  value  of  T  at  that  position  gives  the  magnitude  of  the 
parameter  6.  Q  can  then  be  obtained  from  B’.  This  technique,  can  be  used  to 
generate  a  series  of  solutions  for  various  values  fo  the  parameters.  The 
method  is  quite  simple  and  avoids  cumbersome  trial  and  error  procedure. 

Introducing  the  change  of  variable  (32)  into  equations  (27)  and  (29)  the 
expressions  for  the  pressure  and  the  load  bearing  capacity,  respectively, 
become 


P 


P 


Y2vV 


d2T 


2* 


yv2hok 


[1  - 


dr 


goo], 


(37) 

(38) 


A  Runge-Kutta  numerical  integration  scheme  is  employed  and  examples 

are  considered.  In  the  first  example,  the  values  of  parameters  are  taken 

to  be  ctf  =  1,  and  B1  -  0.667  and  X*  =  0.0  (that  correspond  to  an  incompressible 

fluid  and  a  flow  rate  of  Q  -  B'vh  /2a'  =»  vh  / 3).  The  numerical  solution  of 

o  o 

equation  (33)  with  initial  conditions  given  by  equation  (35)  is  obtained  and 

2  2 

it  is  observed  at  r  «  k  =  6.22,  d  T/dr  becomes  approximately  equal  to  zero. 

The  value  of  parameters  a,  B  and  Q  as  found  from  (34)  thus  become 


a  *  240.6,  B  -  160.4,  Q  *  vh  /3.  (39) 

o 

The  corresponding  variation  of  h/hQ  with  rj  is  shown  in  figure  2.  It  is 
observed  that  the  film  thickness  increases  up  to  a  maximum  of  about  three  times 
of  the  entrance  film  thickness  and  then  decreases  to  about  0.25  h  at  the  exit. 
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(40) 


The  variation  of  the  dimensionless  pressure 


is  shown  in  figure  3.  It  is  observed  that  p  reaches  a  peak  of  about  0.9 
very  close  to  the  exit.  The  load  bearing  capacity  of  the  bearing  is  found 
to  be 

2. 

YV  h 

P  =  18.69  — ^  *  (41) 

In  the  second  example  it  is  assumed  that  a*  -  1.6  »  8’  =  1.067  and 

X’  =  0.0.  The  numerical  solution  yields 

k  =  4.64,  6  =  0.26  »  (42) 

It  then  follows  that 

a  =  159.9,  8  =  106.6,  Q  =  vhQ/3  (43) 

The  variations  of  film  thickness  and  dimensionless  film  pressure  with  n  are 
shown  in  figures  4  and  5.  The  load  bearing  capacity  now  is  found  to  be 

P  ,  14.72 — ,  (44) 

It  is  observed  that  the  load  bearing  capacity  decreases  with  an  increase 
of  a  (i.e.,  an  increase  in  gas  viscosity).  As  a  third  example  we  consider 
a  compressible  gas  film  with  parameter  values  a*  =  1.0,  B\  =  0.5,  X*  =  0.1 
and  n  =  1.4.  The  solution  of  the  initial  value  problem  yields  k  *  5.5450  and 
Tr(k)  =  -2.0884.  This  results  in  a  load  bearing  capacity 

yv2h 

P-  17.191  — (45) 

The  film  thickness  and  pressure  variations  are  shown  in  figures  2-7 
and  are  comparable  with  those  obtained  by  a  perturbation  method  in  [9]. 
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4 .  Concluding  Remarks 


In  the  present  work*  hydrodynamic  theory  of  foil  bearing  is 
considered  and  the  effects  on  gas  compressibility  are  studied.  It  is 
observed  that  the  behavior  of  a  foil  bearing  is  quite  different  from 
that  of  a  regular  rigid  boundary  type.  For  instance,  the  film  pressure 
distribution  has  a  sharp  peak  near  the  exit  of  a  foil  bearing  in  contrast 

j 

to  the  relatively  smooth  peak  observed  about  the  middle  of  the 

conventional  rigid  boundary  types.  Furthermore,  the  dependence  of  the 
load  bearing  capacity  of  these  flexible  bearings  is  drastically  different 
from  the  conventional  one. 

Several  important  problems  such  as  optimization  of  the  load  bearing 
capacity,  two  dimensional  effects,  etc.  are  not  treated  here  and  are  left 
to  future  investigations. 
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Fig.  1  -  Sketch  of  the  bearing. 
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Fig.  3  -  Variation  of  dimensionless  film 
pressure  with  n  for  a  -  240.6, 

6  -  160.4. 
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Fig.  S  -  Variation  of  dimensionless  film  pressure 
with  n  for  a  =  159.9,  6  =  106.6  . 


Figure  6  -  Variation  of  film  thickness  with  rt  for  a  =  170.5, 
g  =  85.2  and  \  -  .0032. 
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Figure  7  -  Variation  of  dimensionless  film  pressure  with 
for  a  =  170.5,  0  =  85.2,  X  =  .0032. 
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ABSTRACT:  In  the  past  twD  decades  significant  amounts  of  resources  have  been 
spent  on  the  improvement  and  development  of  Carmand,  Control,  Carmunicatian, 
and  Intelligence  Systems  (C3I) .  The  bureaucratic  institutionalization  of  the 
cost-effectiveness  approach  within  the  Planning,  Programming,  and  Budgeting 
(FPB)  framework  has  contributed  to  the  isolated  assessient  of  C3I  resulting 
in  highly  centralized  systems  that  maximize  the  use  of  modem  technology  for 
its  own  sake.  However,  C3I  is  but  one  of  five  interdependent  principal  elements 
of  defence  systems  requiring  its  asses snent  in  terms  of  how  much  it  contributes, 
together  with  the  other  elements,  to  the  accomplishment  of  the  overall  defence 
mission.  Hereby,  the  canplenentarity  and  the  degree  of  substitutability  between 
the  physical  and  the  conceptual  elements  of  defence  systems  must  be  explicitly, 
considered,  not  the  least  with  a  view  to  the  limitation  of  available  resources. 
Such  an  assessment  may  only  be  accomplished  through  dynamic  analyses  which 
account  explicitly  for  the  interaction  of  all  system  elements  in  canbat.  Thus, 
conbat  simulation  becomes  the  principal  tool  of  analysis.  The  incorporation  of 
the  military  decision-maker  into  such  simulations  permits  to  also  make  the 
decision-making  process  object  of  exploratory  research  and  to  develop,  in  an 
evolutionary  manner,  decision  support  software.  This  is  illustrated  by  two 
examples.  One  shews  hew  cognitive  maps  of  military  corrmanders  may  be  retrieved 
by  means  of  interactive  conbat  simulations.  The  other  shows  an  approach  to  de¬ 
velop  tactical  decision  models  by  means  of  such  simulations.  Given  a  certain 
technical  capability,  such  research  might  be  carried  on  within  the  frairework 
of  military  exercises  as  proposed  in  the  Compound  Gaming  Approach, 
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V.  c3l:  AN  ELEMENT  OF  MILITARY  SYSTEMS 

Carmand,  Control,  Camtunication  and  Intelligence  (C^I)  has  been  defined  as 
"...  an  arrangement  of  personnel,  facilities  and  systems  for  information 
acquisition,  processing  and  dissemination  employed  by  a  (military)  decision- 
maker  in  planning,  directing  and  controlling  operations"  (see  [1],p.  42). 

Thus,  even  though  it  is  not  a  readily  separable  entity  such  as  a  weapon 
system  or  a  combat  service  support  system,  the  C3I  system  may  be  considered 
as  one  of  the  three  fundamental  physical  elements  of  military  systems  compe¬ 
ting  for  resources.  For  the  force  planner  the  question  is  hew  to  distribute 
the  available  human  and  financial  resources  among  these  elements  so  that, 
in  the  prevailing  threat  environment,  there  is  a  high  probability  that  the 
military  missions  essential  to  meeting  the  national  objectives  can  be  accom¬ 
plished.  Since  the  answer  also  depends  on  the  operational  philosophies  and 
concepts  of  the  respective  military  forces,  we  may,  in  a  sen; what  simplified 
nanner,  conceptualize  the  overall  force  planning  problem  in  the  form  of  a  system 
of  interacting  physical  and  conceptual  elements  as  depicted  in  Fig.  1. 


THREAT  ENVIRONMENT 


Fig.  1:  Fundamental  Elements  of  a  Military  Defence  System 
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The  C3I  system  interacts  with  all  the  other  force  elerents,  not  the 
least  with  the  conceptual  element  of  tactics  and  doctrine.  For  exanple,  a 
military  system  employing  the  Auftragstaktik  practiced  in  the  German  Amed 
Forces  may  very  likely  require  canparatively  little  C3.  Father  than  giving 
more  or  less  detailed  orders  to  the  unit  entrusted  with  a  mission,  according 
to  Auftragstaktik  merely  the  objectives  are  stated, while  it  is  left  to  that 
unit's  ccnmander  to  determine  hew  to  accomplish  the  objectives.  Thus,  it  is 
proposed  that,  when  designing  a  military  force  and  its  C3I  system,  also  tac¬ 
tics  and  doctrine  need  to  be  reviewed  and  adapted  to  maximize  the  probability 
of  mission  accomplishment  within  the  constraints  imposed  by  the  available 
resources. 

2.  THE  PROBLEM  OF  PIECEMEAL  ANALYSIS 

It  appears  that  reality  of  today' s  force  planning  hardly  accounts  fear 
these  interactions.  Rather  it  assumes  the  conceptual  elanents  as  given  para¬ 
meters  and  considers  each  of  the  physical  elenents  more  or  less  in  isolation. 
For  exanple,  force  design  analyses  are  mostly  restricted  to  "optimizing*1  the 
weapons  mix  of  military  defence  systems  tacitly  assuming  the  existence  of 
support  systems  and  C3I  capabilities  that  permit  the  employment  of  the  wear- 
pen  systens  at  their  specified  (maximum)  performance  level ^  ^ .  Tactics ,  doc¬ 
trine  and  operational  concepts  are  usually  not  analysis  subjects  but  are 
considered  invariant  parameters  (see  Canby  [4]).  The  a  priori  specified 
missions  and  objectives  are  assumed  to  be  efficient  in  terms  of  the  overall 
(strategic)  defence  mission.  Therefore,  it  is  likely  that,  as  a  conse¬ 
quence  of  those  analyses,  the  operational  capabilities  of  our  present  military 
-forces  and  the  ccirbat  capabilities  of  many  of  today's  sophisticated  weapon 


^  Typically,  in  thQse  studies  the  objectives  are  defined  in  tents  of  an 
array  of  targets  which  have  to  be  neutralized  within  a  specified  time. 

As  long  as  they  survive,  the  weapon  system  are  assumed  to  operate  con¬ 
tinuously  at  their  technical  and  operational  performance  specification  im¬ 
plying  unlimited  availability  of  targets,  connunication ,  POL  and  spares 
and  assuming  service  support  systems  being  attrited  uniformly  and  at  the 
same  rates  as  the  respective  weapon  system  they  support  (see  e.g.  Huber  and 
Neubecker  [3]) . 
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systems  represent  merely  theoretical  maxima.  In  reality,  inadequate  service 
support  systems,  less  than  perfect  C3I  systems,  and  not  quite  appropriate 

2) 

tactics  must  be  expected  to  degrade  these  capabilities  perhaps  significantly  . 

It  is  true  that,  in  the  past  two  decades,  significant  amounts  of  resources 

have  been  spent  on  inproving  the  C3I  systems  (Teates  et  al.  estimate  about 

9  billion  dollars  have  been  spent  up  to  1980  in  the  OS  alone  [1],p.40) ,  but 

these  efforts  have  not  got  anywhere  near  a  situation  that  would  justify 

3 

the  force  designers1  highly  optimistic  assumptions  regarding  the  C  I  capa¬ 
bilities.  On  the  contrary,  sane  experts  even  believe  that,  by  having  con¬ 
sidered  C3I  in  isolation  and  maximized  the  use  of  modem  technology  for 
its  own  sake,  we  may  have  enBed  up  with  systons  that  will  not  work  in  war 
(Cushman  [5],  p.  46). 

A  bureaucratic  institutionalization  of  the  action-oriented  cost-effective¬ 
ness  approach  within  the  PPB  management  framework  may  have  contributed, 
perhaps  significantly,  to  such  a  situation  brought  about  by  piecemeal 
thinking  in  defence  planning  and  analysis  (see  also  [6,7]) .  In  the  plan¬ 
ning  phase  of  the  PPB  process,  national  goals  are,  through  a  top-down  ends- 
means  analysis,  decomposed  into  elementary  military  tasks.  For  each  of  these  tasks, 
alternatives  to  their  accomplishment  are  determined  in  the  programming  phase. 

Based  on  a  cost-effectiveness  assessment  of  these  alternatives  the  responsible 
planners  compile  program  proposals  for  eventual  implementation  in  the  budgeting 
cycle. 


However,  due  to  the  impact  of  modem  technology  military  defence  systems  tend  to 
become  ever  more  canplex;  i.e. ,  their  elements  are  increasingly  interrelated.  They 
also  represent  open  systems  with  a  highly  dynamic  environment.  Therefore,  the 
decomposition  of  overall  goals  into  subgoals  and  elementary  tasks  will  hardly 
lead  to  set  of  truly  disjunctive  elements  so  that  the  "optimum"  (most  cost- 
effective)  solutions  determined  separately  for  each  of  these  tasks  may,  at 
best,  be  considered  only  as  initial  solutions  that  need  to  be  iteratively 
tested  for  their  mutual  and  environmental  canpatability  in  the  light  of  re¬ 
source  constraints  and  enemy  reactions.  However,  since  military  planners  usually 


2)  It  is  for  this  reason  that  Teates  et  al.  reject  the  view  of  C  1  systems  being 
considered  "Force  Multipliers"  capable  to  somehow  increase  the  etrecti 
or  even  the  numbers  of  a  fixed  set  of  weaponry.  Because  of  the  insepar^^ 
relationship  between  the  C3I  system  and  the  warfighting  assets,  the  torne 
is  perhaps  more  of  a  "Force  Divider"  or  rather  a  "Force  Degrader  (see  F2J,p. 
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operate  in  an  organizational  environment  characterized  by  rather  rigidly 
compartmentalized  hierarchical  bureaucracies ,  the  feedback  necessary 
for  an  iterative  testing  and  adaption  of  solutions  hardly  ever  exists.  But, 
without  it,  the  holistic  idea  that,  being  systems  analysts,  the  authors  of 
the  PPB  approach  undoubtedly  attenpted  to  implement  in  public  planning, 
becomes  perverted  through  the  bureaucratic  institutionalization  of  piece¬ 
meal  thinking  on  the  level  of  elementary  tasks  (i.e.,  the  whole  is  the  sum 
of  its  parts!) .  It  also  contributes  to  the  planner's  preoccupation  with  the 
future  because  the  isolated  problem  solving  makes  him  forget  that  his  solutions 
may  only  be  implemented  in  an  evolutionary  manner,  i.e.,  they  must  be  com¬ 
patible  with  the  present  military  force  and  its  forseeable  overall  evolution. 

As  a  consequence,  force  planning  in  most  NATO  nations  seems  to  have  largely 
degenerated  into  adjusting  program  proposals  (independently  arrived  at  by 
the  services  and  arms  branches)  to  the  available  resources.  Unless  political 
considerations  dictate  otherwise,  resource  constraints  are  usually  effected 
through  cuts  across  all  of  the  elementary  tasks  with  a  bias  toward  the 
more  visible  weapon  systems  rather  than  support  systems  or  even  C^I . 

There  is  no  doubt  in  my  mind  that,  by  attempting  to  be  responsive  to  the  daily 
needs  of  the  military  planners,  the  systems  analysis  oonrnunity  has  contri¬ 
buted  to  this  situation.  As  early  as  1968  Schlesinger  has  postulated  that  we 
must  get  away  from  traditional  systems  analysis  "..in  the  sense  of  analysis 
to  assist  in  a  simple  choice  between  several  given  alternatives  for  accom¬ 
plishing  a  single  objective  or  task"  ([8], p.387) .  An  analysis  approach  is 
needed  that  accounts  for  the  complementarities  among  the  military  missions 
and  tasks,  permits  trade-offs  among  the  conceptual  and  physical  elements  of 
the  military  forces,  and  explicetly  considers  the  opponent's  presumed  reactions 
to  the  operational  and  structural  planning  options. 

That  the  approaches  of  what  Schlesinger  called  traditional  systems  analysis 
are  still  very  much  alive  in  military  planning  is  obvious  from  the  rather  wide 
use  of  multilinear  functions  for  the  assessment  of  the  effectiveness  or  utility 
of  systems.  A  typical  example  can  be  found  in  a  1980  publication  on  a  study 
to  define  the  requirements  for  a  new  combat  aircraft  for  the  German  Air 
Force  [7,91.  There,  the  GAF  mission  was  defined  in  terms  of  the  number  of 
j-type  targets  (j-1,...m)  that  have  to  be  neutralized  within  a  specified  time 
period  in  a  conflict.  With  x i  denoting  the  number  of  weapon  systems  of  type  i 
and  or  j  the  number  of  j-type  targets  that  an  i-type  system  can  be  expected  to 
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neutralize  within  the  specified  time,  the  mission  was  defined  by  m  linear 
equations 


xij 


Vj 


(1) 


where  x^  is  the  number  of  i-type  weapon  systems  allocated  to  j-type  tar¬ 
gets.  Using  an  LP,  the  (maximum)  mission  capability  of  the  existing  systems 
i  =  1 , . . .  ,k  was  determined  fran 


m  k 

P  =  E  2  a  .x  .  =  Max  (2) 

c  j=i  i=i 

such  that  Eq.  (1)  and  the  conditions 


m 

E  x. ,  <  x.  Vi,  (3) 

j=i  3 


were  satisfied.  It  turned  out  that 


P 


anax 


1  m 

E  P.  <  E  P. 

j-i  3  j-i  3 


(4) 


i.e.,  the  maximum  potential  of  the  existing  force  was  only  sufficient  to  cover 
the  targets  of  types  j=1,...l(l  <  m)  leaving  a  deficit  of 


m 

PD=  S  ** 

D  j=l+1  3 


(5) 


as  the  mission  for  which  a  new  system  had  to  be  specified.  From  the  avail¬ 
able  alternatives  i  -  k-H,...,n  the  one  with  the  lowest  life-cycle  cost  C 


m 


Ci=  xici(Xi/x1#...Xk)  =  Min,  xi  =  E 


j-1+1 


Cij 


(6) 


satisfying  Eq,  (5)  was  selected  as  the  candidate  alternative  for  the  definition 
of  the  requirements.  The  creative  contribution  of  systems  analysis  was  more 
or  less  restricted  to  the  piecemeal  determination,  target  by  target  and 
system  by  system,  of  the  target  neutralization  capability  . 


Another  exairple  for  the  use  of  multilinear  functions  in  weapon  systems 
planning  can  be  found  in  a  recent  publication  an  the  model  employed  by  the 
Genran  Navy  for  the  assessment  of  the  new  frigate  122  (see  [10],  p.200).  But 
also  analysis  engaged  in  net-assessment  uses  such  linear  functions. 

An  interesting  example  for  this  is  Lucas  Fischer's  study  of  1976  on  the  conven¬ 
tional  balance  of  forces  in  Central  Europe  [11],  He  measures  the  balance 
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by  the  quotient  of  two  multilinear  functions  each  expressing  capabilities  which 
the  Warsau  Pact  and  NATO  are  able  to  deploy  as  a  function  of  time  after  the 
pact  started  mobilization: 


*  _  Vi  t-WlV-Vn 

*  B.V.  + _ +B.v.+*..B 

m 


Vi  +"-+ejy5+-”Biny, 


(7) 


where  denotes  the  Pact  and  the  NATO  inputs  (such  as  soldiers  and 
weapon  systems) ,  and  3j  the  marginal  capabilities  of  the  respective 
inputs. 


3,  THE  NEED  FOR  DYNAMIC  ANALYSES  THROUGH  COMBAT  SIMULATION 
In  order  to  illustrate  the  type  of  analysis  required  in  military  planning, 
Huber  17  #12]  has  interpreted  military  defence  as  a  production  process.  In 
analogy  to  economic  theory  he  proposed  an  extended  production  function 


* 


P 


9  *  *  • 


Vyi 


(8) 


•k 

in  which  P  denotes  the  defence  product  in  terms  of  a  net  capability  of  the 
opponents  X  and  Y,  xi  and  their  respective  inputs.  In  fact,  Eq.  (7)  is 
but  a  special  functional  form  of  such  an  extended  production  function. 

When  each  side  X  and  Y  has  but  one  type  of  input  (homogeneous  force  structure) , 
equation  (7)  reduces  to 


p*  _  ax  (9) 

P  By 

describing  the  victory  conditions  for  Lanches ter- type  battles  between  X  and  Y, 
Lanchester  [13]  hypothesized  that  under  certain  conditions  attrition  in  cam- 
bat  between  two  homogeneous  forces  is,  for  each  side,  proportional  to  the 
numerical  strength  of  the  eneny.  Thus,  we  may  write: 


} 


-ax 


(10) 


with  x  and  y  denoting  the  instantaneous  numbers  of  live  units  of  sides  X 
and  Y,  and  a  and  b  the  so-called  attrition- rate  coefficients.  Eliminating  dt 
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from  Eq.  (10)  and  solving  for  the  initial  conditions  x  =  xQ  and  y  =  yQ,  we 
obtain 


a(x2  -  x2)  =  b(y2  -  y2) . 


(11) 


If  we  consider,  e.g.,  X  the  winner,  if  y  =  0  and  x  >  0,  we  readily  deduce  from 
(11)  that 


or 


K  = 


x  /a*" 
o 

yQ 


>  i. 


(12) 


With  a  =  /a”  and  3  =  Zb"  Eq*  (9)  and  Eq.  (12)  are  identical. 

This  result  shows  that  the  military  planner's  production  function  is  essen¬ 
tially  a  solution  to  a  mathematical  model  of  battle  or  war.  Hcwever ,  for  actual 
planning  purposes  Eq,  (10)  is  hardly  a  sufficiently  realistic  model.  To  this 
end,  the  rtodel  must  be  considerably  enriched  (e.g., accounting  for  hetereogeneous 
forces  augmented  by  combat  support  elements  and  under  conditions  of  variable 
attrition-rate  coefficients)  in  which  case  analytical  solutions  are  rather 
impossible.  The  only  feasible  approach  to  eventually  specify  military  produc¬ 
tion  functions  is  the  experimental  one  through  battle  simulations. 

Attempts  to  derive  such  functions  frcm  records  of  historical  conflicts  and/or 
personal  cartoat  experience  are  usually  frustrated  by  the  randomness  of  canbat 
circumstances  and  by  insufficient  records  and  recollections.  But  even  if  we 
had  perfect  records,  the  usefulness  of  production  functions  derived  frcm  then 
would  be  rather  questionable. 


This  is  because  we  can  not  be  sure  whether  the  historical  processes  represent 
efficient  processes  in  the  sense  that  both  sides  employed  their  forces  in 
a  mutually  "optimal"  manner.  The  implicit  efficiency  assumption  of  economic 
production  theory  (that  the  production  processes  from  which  the  production 
function  and  its  coefficients  are  determined)  is  only  justified  as  long  as  a 
oonpeting  market  eliminates  inefficient  producers.  Hcwever  such  an  a  priori 
assumption  is  highly  questionable  in  the  defence  field  because  (1)  defence 
organizations  ireet  few  of  the  characteristics  of  competitive  firms;  (2)  their 
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ultimate  objective  is  nothing  less  than  trying  to  prevent  the  "narket"  from 
happening.  Thus,  an  explicit  analysis  of  the  processes  by  which  military  inputs 
are  converted  into  outputs,  that  is  an  examination  and  eventual  adaption  of 
the  operational  principles  and  doctrines  controlling  the  military  production 
processes,  is  an  indispensable  task  within  military  force  planning.  Techno¬ 
logical  change  makes  this  an  even  more  urgent  requirement,  even  if  combat  ex¬ 
perience  is  available.  Christopher  Harvie's  essay  on  "Technological  Change  and 
Military  Power  in  Historical  Perspective”  [14]  presents  ample  historical  evi¬ 
dence  on  both,  hew  the  adaption  of  operational  principles  to  new  technology 
provided  the  decisive  edge,  and  hew  a  retrospective  military  ideology  stood 
in  the  way  of  innovation^  . 

Steven  Canty1  s  1973  criticism  of  the  systems  analysts  around  Alain  Enthoven 
mainly  concerns  the  efficiency  assumption  implied  in  their  analyses.  They 
accepted  the  philosophies,  concepts,  and  operational  principles  underlying 
military  organizations  as  given  rather  than  treking  them  subject  of  the  ana¬ 
lysis.  That  Enthoven  considered  them  to  be  efficient  is  obvious  from  his  argument: 

"Army  force  planners  must  be  satisfied  with  the  current  force  structure  because 
they  have  not  proposed  changes  when  invited  to  do  so"  (see  [4  ]  ,p.9) . 

In  the  context  of  long  range  armaments  planning  Huber  [15]  discusses  an  example 
which  demonstrates'  the  impact  of  operational  concepts  on  capabilities  and  struc¬ 
tures  of  two  opposing  tactical  air  forces.  Based  on  the  assumption  that  tactical 
air  operations  are  performed  in  direct  and/or  indirect  support  of  the  land  battle, 
he  considers  tactical  air  we r  as  a  multistage  game  where  the  adversaries  decide, 
at  each  stage,  how  to  allocate  their  tactical  aircraft  to  the  basic  tactical  air 
missions  of  Offensive  Counter  Air  (OCA)  and  Offensive  Air  Support  (OAS) ,  so  that 
their  respective  capabilities  to  support  the  land  battle  become  a  maximum  re¬ 
lative  to  that  of  the  enemy.  With  respect  to  force  structure  planning,  both  of 
the  two  fictitious  antagonists  are  considered  to  be  constrained  by  constant  budgets. 


The  German  Blitzkrieg  strategy  of  World  War  II,  combining  a  mass  tank  offen¬ 
sive  with  infantry  and  air  support,  is  the  classical  example  for  both.  It  had 
been  anticipated  by  Puller  and  Lidell  Hart,  but  it  was  dismissed  by  the 
traditionalists  in  the  British  Army.  But  in  Weimar  Germany,  the  hard  restric¬ 
tions  of  the  treaty  of  Versailles  enforced  a  drastic  break  with  the  past.  It 
encouraged  the  development  of  new  options  which  became  evident  in  the  early 
canpaigns  of  WW  II.  Hcwever,  the  operational  and  technical  innovations  of 
the  Germans  in  the  land/air  war  domain  were  not  matched  in  naval  warfare. 

In  fact,  their  naval  construction  plans  were  quite  old-fashioned,  an  indi¬ 
cation  that  the  navy's  operational  thinking  essentially  resumed  where  it 
had  ceased  after  World  War  I. 
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Tab.  1  shows  results  of  four  games  in  terms  of  the  relative  QAS-capabi lities 

4) 

of  defender  (V)  versus  attacker  (A)  .  The  underlying  force  structures  are 

characterized  by  heavy  (H)  and  medium  (M)  tactical  aircraft  on  which  both 
sides  spend  half  of  their  available  budgest  each.  A  is  assumed  to  have  30% 
more  aircraft,  V  to  have  aircraft  of  somewhat  better  performance  with  res¬ 
pect  to  payload  and  weapon  effects  as  well  as  sortie  capability. 


attacker  A 

no  OCA 

opt,  OCA 

>  no  OCA 

0.86 

0.99 

$ 

V  opt. OCA 

2.87 

1.0 

Tab.  1 :  Relative  OAS -capabilities  of  V  versus  A 

The  values  in  Tab.  1  are  normalized  around  the  case  where  both  sides  pursue  their 
optimum  policies  in  terms  of  mission  allocations  (opt.  OCA).  Thus,  all  values 
must  be  interpreted  in  relation  to  the  "balance"  of  that  case.  Whatever  actual 
OAS- capability  ratio  rrey  have  resulted  there,  the  values  indicate  that  it  reduces 
to  86%  if  both  sides  allocate  their  aircraft  to  OAS  only.  The  values  also  shew 
that  the  attacker  A  is  practically  forced  to  open  the  campaign  with  an  OCA  operation 
If  A  leaves  the  OCA  to  V,  then  the  OAS-potential  ratio  increases  by  a  significant 
factor  (2.87) .  This  result  is  due  to  the  superior  OCA  performance  characteristics 
of  V's  systems  which  A  must  prevent  from  becoming  effective.  This  is  true  in 
spite  of  the  low  OCA  effectivenss  of  A's  systems  which  do  not  significantly  reduce 
the  OAS  ratio  (fran  1  to  0.99)  when  V  does  not  react  to  A’s  OCA. 


The  terms  attr^cv  and  defer.'1  ev  do  not  imply  that  the  respective  antagonist 
is  limited  to  offensive  or  defensive  operations.  They  only  indicate  that  the 
attacker  initiates  the  hostilities,  i.e.,gets  the  first  move,  to  which  the 
defender  reacts.  Both  have  perfect  intelligence. 
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This  example  illustrates  that,  for  given  force  structures  on  both  sides,  the 
balance  may  vary  significantly  depending  on  the  operational  concepts  chosen  by 
each  side.  This  is  to  say  that  the  efficiency  of  military  production  processes 

is  not  only  depending  on  one’s  cwn  operational  principles  and  doctrines,  it  is 
equally  sensitive  to  those  of  the  potential  enemy.  Quite  similar  findings  were 
recently  presented  by  Farrell  [16]  who  has  evaluated  a  large  number  of  simu¬ 
lated  land  combat  histories.  He  showed  that,  in  many  instances,  tactics 
and  doctrine  have  a  much  more  decisive  influence  on  ooirbat  outcome  than  weapon 
system  performance  parameters. 

The  triplications  of  these  findings  for  the  systems  analysis  supporting  the 
planning  of  military  systems  are  obvious:  Military  systems  and  their  elements 
nay  be  adequately  assessed  only  through  dynamic  analyses  employing,  in  an  experi¬ 
mental  fashion,  gaming  models5^ ,  which  "act  out"  combat  and  provide  the  infor¬ 
mation  necessary  to  trade-off  resources  ( men, sys terns ) ,  structure, doctrine  and 
tactics  explicitly  accounting  for  the  same  factors  on  the  potential  opponent’s  side. 
Being  the  art  of  srploying  ccmbat  resources,  tactics  and  doctrine  are  implemented 

through  command  decisions  requiring  some  kind  of  C3I  capability.  Therefore,  a  more 

3 

or  less  explicit  representation  of  C  I  in  combat  simulations  is  prerequisite  to 
(1)  a  more  realistic  assessment  of  the  capabilities  of  the  existing  forces;  (2) 
the  evolution  of  tactics  and  doctrine  such  that  the  inherent  force  capabilities  can 
be  fully  exploited;  and  (3)  the  "balanced"  design  of  military  systems  in  general 
and  of  C^I  system  in  particular. 


The  Term  "gaming"  is  used  to  characterize  two-sided  battle  models  in  which 
the  opponents  react  to  each  others  actions  either  through  interaction  of 
human  carmanders  or  through  a  formalized  contingency  logic  or  through  simple 
decision  rules.  Such  games  ccnprise  the  entire  range  from  military  exercises 
and  combat  experiments  employing  men  and  equipment  to  the  highly  abstract 
analytic  games .  The  above  discussed  air  war  game  is  an  example  of  an  analytic 
game  using  simple  decision  rules.  It  describes  the  states  xu  and  yU  of  the 
opposing  air  forces  at  time  step  u  by  two  state  equations 


x(u+1) 


y(U+D 


where  a  and  B  denote  the  operational  strategies  of  the  adversaries.  The  game 
theoretic  decision  rule  is  given  by 


a 


nan 
e  E, 


B. 


max 
e  E 


U 


D 


where 

sive 


•  U  =  (<i#  /Ir  (a,  3  )  is  the  utility  function  with  I_  and  I, 

air  support  potential^  of  defender  and  attacker  respectively. 


the  offen- 
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4.  INTERACTIVE  SIMULATION  FOR  C3I  RESEARCH 

It  seems  to  be  generally  true  that  the  acceptance  of  models  of  socic^ technical 
systems  tends  to  decrease  as  the  degree  to  which  human  factors  influence  the 
system  dynamics  increases  ( see, e. g. , Schultz  and  Slevin  [17]).  The  evidence  avail¬ 
able  from  military  OR/SA  certainly  attests  to  that.  With  air  combat  being  in¬ 
fluenced  to  a  much  larger  extent  by  physical  and  engineering  than  by  human 
processes,  at  least  when  compared  to  land  combat,  air  war  models  have  become 
accepted  earlier  and  to  a  much  higher  degree  than  land  war  models^  . 

But  a  similar  pattern  can  be  recognized  within  land  combat  modelling  if  we 
ackcwledge  that  the  acceptance  of  models  is  closely  correlated  to  their 
state-of-the  art.  Of  the  six  combat  processes  usually  distinguished  in  the 
literature  (see,e.g.  ,Lcw  [18]),  modelling  of  attrition  and,  to  a  somewhat 
lesser  degree,  of  movement  is  much  more  advanced  than  modelling  of  suppression, 
zcmbaz  support ,  combat  service  support ,  and  of  C'l  (see  Huber  [19]).  With  re¬ 
gard  to  the  latter,  last  year's  NATO- symposium  on  "Modelling  and  Analysis  of 
Defence  Processes"  concluded,  that  "  . .  for  those  processes  that  are  well 
understood  in  the  sense  of  physics  and  engineering,  there  are  quite  adequate 
models  available.  These  include  ccnmunications ,  the  electronic  effects  of  ECM 
and  decoys,  collection  system  performance,  and  computer  processing.  The  weakest 
link  in  modelling  C3I  processes  and  systems  is  the  human  element.  Not  much  is 
known  about  the  higher  order  cognitive  functions  and  the  population  of  decision 
makers  who  implement  tactics  and  doctrine  and  respond  to  intelligence  and  ECM" 
([19] ,p.15) . 

In  a  historical  perspective,  this  situation  seems  not  surprising.  Because  it 
was  the  very  ignorance  of  how  tactical  decision  makers  operate  in  a  more 
complex  combat  environment  that  made  the  systems  analysts  discover  the  rather 
old  military  tool  of  interactive  gaming"^  .  By  the  incorporation  of  a  human 


6)  The  history  of  military  OR/SA  in  Germany  is  proof  of  that.  It  started  in 
1962  when  air  force  and  navy  initiated  the  operations  of  analysis  groups 
to  be  followed  by  the  army  only  about  half  a  decade  later.  Also,  while 
air  force  and  navy  emphasized  combat  modelling  right  from  the  start,  most 
of  the  initial  army  studies  were  related  to  logistics. 

^  ^  In  most  closed  combat  simulation  models  and  in  the  •  analytic  games  the 
combat  environment  is  rather  simple.  A  typical  example  is  the  above 
discussed  analytical  air  war  game.  There,  the  opposing  commanders  only 
decide  on  the  proportion  of  their  forces  allocated  to  Offensive  Counter 
Air  and  to  Offensive  Air  Support.  The  mcdel  assumes  that,  in  doing  so, 
they  have  perfect  information  on  the  instantaneous  states  and  the  histories 
of  their  own  and  the  enemy's  resources.  Also,  practically  the  entire  c  I 
system  is  represented  by  the  min-max  decision  rule. 
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garner  to  represent  the  tactical  decision  maker,  the  problem  of  inadequate  de¬ 
cision  models  was  circumvented.  But, by  doing  this,  C3I  becaire  an  inplicit  ele¬ 
ment  of  the  military  system  models  that  did  not  readily  lend  itself  to  being 
traded-off  versus  other  elements.  In  interactive  combat  simulations  gaiters 
and  game  controllers  more  or  less  "simulate"  the  C3I  systems. 

The  development  of  formal  models  of  C  I  systems  was  part  of  irore  or  less 
isolated  assessments  that  were  mostly  restricted  to  the  two  lower  system  levels 
defined  by  Alberts  [20],  that  of  technical  system  performance*'*  and  that  of 
information  attributes  ^  (see  Fig.  2) .  If  performed  at  all,  assessments  on  the 
third  level  of  information  value  usually  assumed,  in  true  piecemeal 
fashion,  that  the  information  value  increases  monotonically  as  the  in¬ 
formation  attributes  improve  (motto:  more  and  faster  is  better  than  less 
and  slower) . 


Tig.  2:  Task  Specific  Measurement  Levels  of  C 3  I  Systems  ( Alberts  1980) 


There  have  been  several  proposals  on  how  to  ireasure  the  information  value  of 
3 

C  I  systems  ( see, e.g„, Cushman  [5],  Alberts  [20],  Miller  [21],  Huber  and 


E.g.,  carmunications  speed,  memory  size,  access  time,  instruction  complexity , 
and  I/O  characteristics. 

Timeliness,  currency,  accuracy,  ccmpleteness ,  and  ease  of  use. 
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Hofmann  [22]) ,  They  all  agree  that  such  measures  must  somehow  express  the  con¬ 
sequences  as  to  the  expected  course  and  outcome  of  the  operations  which  the 
respective  C^I  systems  are  serving.  In  the  context  of  battlefield  C  I,  this 
would  be  best  accomplished  through  battle  simulation  ^ . 

But  since  data  become  information  only  after  being  processed  in  the  re¬ 
cipient's  mind,  such  simulation  experiments  need  to  be  interactive,  i,e., 
they  must  include  the  tactical  decision  maker.  Interactive  simulation  would 
also  permit  to  make  the  cognitive  processes  of  military  commanders  object  of 
exploratory  research  thus  helping  to  close  the  above  indicated  gap  of  know¬ 
ledge  on  the  human  element  in  C3I  systems.  And  last  but  not  least,  interactive 
simulation  is  a  conditio  sine  qua  non  for  the  evolutionary  development 
and  the  test  of  decision  support  systems  and  the  application  of  artificial 
intelligence  in  battlefield  C^I. 

Indeed  there  is  empirical  evidence  that  seems  to  underline  the  desirability 
of  some  decision  support  capability  for  the  field  carmander.  Fran  series  of  69 
interactive  combat  games  involving  23  sets  of  players  Daniel  [23]  arrives,  among 
others,  at  the  conclusions  that  (1)  more  data  does  lead  to  better  quality 
decisions,  though  the  effect  is  small  compared  to  the  variations  in  results 
between  different  players;  (2)  prior  intelligence  (as  opposed  to  intelligence 
obtained  throughout  the  simulated  battles)  obviously  dominates  decisions;  (3) 
players  who  make  the  "best"  decisions  take  considerably  longer  than  average  to 
play  the  gaire  (in  fact,  Daniel's  slowest  player  took  twice  as  long  as  the 
fastest  player) ;  and  (4)  the  "poorer"  players  do  lot  better  with  more  data  then  the 

"better"  players  do  with  less  data. 

According  to  Daniel,  the  question  as  to  whether  the  rather  small  Impact  that  data 
levels  (of  in-battle  intelligence)  had  on  the  quality  of  player  decisions  is 
symptomatic  of  player's  inability  to  make  use  of  the  data,  or  merely  reflects 
the  fact  that  high  data  levels  are  perhaps  superfluous,  is  yet  to  be  answered. 

But  either  way,  we  might  oonclude  that  merely  providing  more  and  more  current 

3 

data  may,  for  battlefield  C  I,  yield  only  disappointing  returns.  Rather,  com¬ 
manders  should  be  given  same  data  "processing"  capability  thus  providing  them 
effectively  more  time  for  their  decisions. 

This  suggests  that,  contrary  to  the  hitherto  practiced  philosophy 


We  concur  with  Cushman1  s  observation  that  " . .  a  program  of  battle  simulation 
will  also  foster  incremental,  evolutionary  growth  of  C  systems,  through 
a  process  of  systematic  trial  and  modification,  in  the  absence  of  conflict 
([5] ,p.47) . 
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of  designing  highly  centralized  C^I  systems  in  the  fashion  of  the 
classical  management  information  systems,  we  perhaps  ought  to  pursue 
a  highly  decentralized  architectural  approach  with  sane  AI  capability. 

Such  a  philosophy  would  also  permit  pursuit  of  the  evolutionary  growth 
of  C3I  systems  postulated  by  Cushman  because  the  necessary  battle  simu¬ 
lations  could  be  largely  performed  as  part  of  routine  training  exer¬ 
cises  using  the  processors  of  such  a  decentralized  C2I  system. 

Indeed,  the  so-called  oommand  apd  staff  simulators  (CSS)  proposed  by 
Huber  [7,12,24]  as  part  of  a  comprehensive  systems  analysis  approach  in 
support  of  force  planning11^  could  be  gradually  materialized  as  part  of  such 
a  C^I  evolution.  The  CCS  would  be  basically  designed  as  interactive  computer 
games  providing  military  staffs,  at  all  oarmand  levels,  a  dynamic  (acrobat)  en¬ 
vironment  for  their  work.  "In  addition...,  CSS-systems  would  also  permit  to 
better  assess  staff  performance.  They  would  provide  continuously  updated  in¬ 
formation  on  ccmrand  and  control  cycles  as  well  as  a  readily  available  testbed 
for  contend  and  control  systems"  (see  [12]  ,p.  107). 

As  of  today,  we12*  have  through  a  series  of  theses,  developed  the  basic 
software  package  for  a  battalion  /  brigade- level  CSS  to  demonstrate  the 
feasibility  of  such  systems  in  form  of  a  portable  prototype  simulator  to  be 
developed  within  a  research  program  (hopefully)  supported  by  the  German 
Amy.  In  addition  to  providing  a  training  tool,  this  simulator  is  conceived 
as  an  instrument  for  empirical  research  on  tactical  decision  processes  with  a  view 
to  the  development  of  decision  support  systems  in  C3I  (see  [25],p,4) . 

5.  TWO  EXAMPLES 

In  order  to  illustrate  the  type  of  problems  to  be  tackled  through  gaining 
experiments  by  means  of  interactive  ccmbat  simulation,  two  recently 


I. e.,  the  "Ccnpound  Gaming  Approach"  employing  hierarchically  ordered  fami¬ 
lies  of  interacting  combat  models  of  both,  the  formal  and  the  physical  kind, 
which  permit  addressing  future  issues  and  alternatives  in  the  light  of 
current  capabilities  and  deficiencies. 

12 ^ Together  with  Prof.  Hans  W.  Hofmann  at  the  Institute  of  Applied  Systems 
and  Operations  Research  of  the  German  Armed  Forces  University. 
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published  examples  shall  be  briefly  reviewed.  One  is  on  the  use  of 
gaining  to  develop  cognitive  maps  as  means  to  establish  the  way  in 
which  tactical  carmanders  model  their  (subjective)  decision  environ- 
nent.  The  other  describes  an  approach  to  deriving,  based  on  information 
obtained  from  interactive  gaming  experiments,  a  decision  model  for  tacti¬ 
cal  situations  in  which  a  multitude  of  criteria  have  to  be  considered. 

5.1  Cognitive  Mapping 

In  the  1982  paper  on  "Cognitive  Maps  of  Decision-Makers  in  a  Complex  Game" 

[26],  Klein  and  Cooper  report  on  a  series  of  manual  gaming  experiments 
in  which  a  number  of  players  acting  as  divisional  commanders  were  confronted 
with  two  scenarios  each,  a  defence  scenario  and  a  advance-to-contact  scenario. 
The  players  believed  to  be  part  of  a  team  (consisting  of  themselves  and  of 
one  superior  and  two  subordinate  commanders)  playing  interactively  against 
a  purposeful  enemy.  But#without  the  divisional  commanders  kncwing  this, 
the  enemy  and  divisional  players’  team-mates  were  played  by  the  game  control¬ 
lers  with  their  actions,  entirely  predetermined.  Thus,  the  players  could  be 
led  through  the  same  sequence  of  pre-planned  events  in  the  game,  so  that 
their  behaviour  could  be  compared  under  an  identical  sequence  of  objective 
circumstances.  During  the  course  of  the  game,  in  each  time  period,  the 
players  had  to  make  reports  to  their  superior  commanders  and  to  issue  direc¬ 
tions  and  orders  to  their  subordinate  commanders.  This  communication  was 
taped  and  from  the  transcripts  of  the  recordings,  cogn'  tivs  maps  were  derived 
for  each  player  . 

"A  cognitive  map  is  a  representation  of  the  perceptions  and  beliefs  of  an  indi¬ 
vidual  about  his  cwn  subjective  world"  ([261,p.63).  It  depicts  the  concepts 
used  by  the  individual  and  the  causal  relationships  between  them.  In  their 
experiments,  Klein  and  Cooper  only  considered  two  types  of  relationships, 
positive  and  negative.  A  positive  relationship  exists  when  a  change  in  the 
predecessor  concept  causes  a  similar  change  in  the  successor,  A  negative 
relationship  characteristics  a  case  where  an  increase  (decrease)  in  the  pre¬ 
decessor  causes  a  decrease  (increase)  in  the  successor.  As  an  example,  the 
player  statement  that  "  , ,  morale  is  high,  as  they  (members  of  a  friendly  unit) 
are  advancing  with  little  opposition..."  is  analyzed  to  exhibit  a  positive 
relationship  between  the  concepts  of  nhigh  morale "  and  "unopposed  advance ", 

As  an  example,  Fig.  3  shows  the  cognitive  maps  thus  derived  for  too  different 
players  in  the  advance-to-contact  scenario. 
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IB  ]  =  action  open  to  friendly  player  8/0  =  friendly  goals  / 

R..  *  action  open  to  enemy  /  N.,S.,P.  *  external  factors  /  U  s  friendly  utility 

Fig.  3:  Cognitive  Mops  in  Advance-  to  -  Contact  Scenario 
(  Klein  and  Cooper  [27)  ) 


Fran  such  maps  Klein  and  Cooper  noticed,  among  others,  a  rather  significant 
difference  in  the  nuirker  of  concepts  identified  by  different  players  and  in  the 
densities  of  their  cognitive  maps.  But,  for  the  majority  of  players,  the  number 
of  concepts  was  quite  similar  in  the  two  scenarios.  This  leads  them  to  con- 
elude  that  the  number  of  concepts  " . .  has  apparently  little  to  do  with  the 
objective  situation  and  my  present  seme  limit  to  the  quantity  of  concepts  that 
the  decision-maker  feels  he  can  usefully  cope  with  at  any  one  tine"  [26]  ,p.66)  . 


13) 


Number  of  observed  links  divided  by  the  maximum  number  of  possible  links. 
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But  the  fact  that  the  larger  maps  generally  exhibit  a  conparatively  lew 

density  seems  to  indicate  that  larger  maps  contain  more  peripheral  concepts 

of  limited  influence,  "..the  central  sections  of  different  players’  maps 

are  . . .  appearing  in  the  same  or  slightly  altered  form  in  several  players '  naps . " 

<[26],p.66). 

From  these  and  other  results  it  appears  that  cognitive  napping  should  be  a 
valuable  tool  for  the  structuring  of  knowledge  bases  in  tactical  expert 
systems  and  for  organizing  their  data  bases  through  a  series  of  properly  designed 
cembat  experiments.  Using  interactive  computer-simulation,  it  should  also 
become  possible  to  shed  some  light  on  the  largely  unresolved  issue  of 
decision  quality  as  a  function  of  map  size  and  density  and  on  the  inpact  of 
training  and  doctrine  with  regard  the  adequacy  of  decisions  in  given  scenarios. 

5.2  Decision  Modelling 

The  idea  of  using  interactive  caribat  simulation  to  develop  descriptive  decision 

models  for  incorporation  into  closed  combat’  games  has  been  proposed  by  Keidelhuber 

[28, 29] .  But  it  could  also  be  employed  for  the  development  of  decision  aids  in 
2 

tactical  C  systems. 

Reids lhuber  interprets  the  tactical  decision  problem  as  having  to  choose  the 
action  which  is  the  (most)  appropriate  in  a  given  decision  situation  des¬ 
cribed  by  a  data  vector 


*D  “  (13) 

of  m  criteria  that  may  be  thought  of  as  a  two  dimensional  profile  as  shewn  in 
Fig.  4a. 

In  order  to  find  A^,  Keidelhuber  assumes  that,  in  a  specific  decision  situation, 
there  is  a  known  set  of  possible  actions  A^(i=1 , . . .  ,n)  and  a  set  of  data  vectors 


{xn  -  •  • 


04) 


each  of  which  is  reference  profile  for  the  corresponding  (see  Fig.  4b) .  The 
decision  problem  thus  reduces  to  the  question  as  to  which  A  is  best  in 
situation  and  should  be  selected  as  AQ. 
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1,0 


Criterion 

value 


o-l - 1 - I - U 

*01  *02  *03  *04 

Fig.  4a.  Actual  profile 


*0m-l  *0m 

Criteria 


Fig.  4b.  Reference  profile* 


Rg.  4;  Profiles  in  a  Decision  Situation  (  Reidelhuber  1982  ) 


To  this  end,  Reidelhuber  proposes  the  decision  rule 

gP(V  rij  (7ij  (15) 

Where  rij  denotes  relevance  factor  taking  into  account  that  different 
criteria  may  have  different  weights  in  the  decision  considerations. 
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For  the  determination  of  the  reference  profiles  and  the  relevance  factors 
a  number  of  p  similar  decision  situations  (p  >>  n)  is  generated  in  course  of  an 
interactive  combat  simulation  to  find  p  data  vectors  thus  establishing  a  matrix 
X  of  primary  data. 


x11'”"x1j",-,x1m 

* 

* 

’Vl '  •  •  •  ,xkj '  •  ■  •  ,xkm 
* 

xpi'”"xpj'’*’'V 


(16) 


Furthermore,  of  the  n  actions  Ai  taken  by  the  players,  one  is  assigned  as  the 
best  to  each  of  the  p  situations,  i.e.,  to  each  row  in  the  matrix  X.  Then  all 
data  vectors  with  the  same  action  assigned  are  collected  in  disjunctive  classes. 
For  each  action  A. ,  the  reference  profile  X^  is  determined  as  the  vector  of  the 
mean  criteria  values  x  of  the  class  members 

qi  q.  qi 

X.  =t{2  XI,  (A. ) . i  X.  .(A. ),...,  £  x1  (A.)}  (17) 

1  %  1=1  1  i=l  J-7  1  1=1  “  1 

with  qi  as  the  number  of  members  assigned  to  class  i. 

The  relevance  factor  r,  .  is  defined  as  the  reverse  of  the  scaled  standard  devi- 
ation  s.  .  of  the  criteria  values  found  within  each  class  i.o. 


1 

~k~~  t 

Sij 


1 


m 


s.  .  +  2  s, 

ij  ro  il 


m 

2 


m 


2  (s.  +  ~  2  s., ) 

ig  m  1=1  il 


8ii  +  si 
2m  i 


— 1_  £  (x  -xli(A  )  )2]1/2 

V1  1=1  ^  i 


(18) 


(19) 


(20) 
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with  at  least  one  s  ^  o  in  each  class ,  The  idea  .of  this  definition  is  that  the 
relevance  of  a  criterion  for  the  decision  maker  is  the  smaller  the  more  strongly  the 
criterion  value  deviates  when  he  selects  the  sane  action.  The  transformation 
means  normalizing  s  and  shifting  the  zero  point,  because  no  relevant  criterion 
should  have  a  weight  that  is  too  small  or  too  large.  The  value  region  of  s* .  is 


2m 


irH-1 

2m 


Fig.  5  shews  Reidelhuber's  concept  for  the  development  and  adaption  of  the 
decision  model  to  new  weapon  systems,  to  alternative  engagement  tactics,  and 
to  modified  earned  and  control  doctrines.  It  starts  with  the  existing  de¬ 
cision  structure  and  ends  with  the  test  of  the  decision  nrxiel  in  a  "closed" 
simulation  (decision  model  replaces  player) .  In  case  the  results  are  not  acceptable, 
the  player  may  interactively  rrodify  the  decision  model. 


Fig.  S:  Concept  for  the  Development  of  the  Toctieai  Decision  Model 
(  Retdelhuber  1962  ) 


699 


6.  FINAL  REMARKS 

In  this  paper,  the  attempt  was  made  to  1)  demonstrate  the  necessity  for  the 
employment  of  interactive  combat  simulation  in  order  to  provide  the  infor¬ 
mation  for  a  truly  holistic  assessment  of  military  systems  in  general  and 
C3I  systems  in  particular;  2)  illustrate  how  interactive  gaming  could  assist 
in  developing  and  testing  models  of  cognitive  processes  in  C^I,  thus,  not  only 
helping  to  close  a  fundamental  modelling  gap,  but  also  aiding  the  evolution 
of  new  systems  with  some  AI  capability. 

To  this  end,  the  gradual  implementation  of  a  "compound  gaming  approach"  is 
suggested,  because  in  the  absence  of  military  conflicts  there  is  hardly  a 
viable  alternative  to  assure  an  adaptive  control  of  the  development  of 
military  forces  and  systems  in  a  mission  oriented  context  and  with  due  regard 
to  the  limited  availability  of  resources.  In  particular  with  a  view  to  a 
"balanced"  development  in  the  sense  of  a  robust  combination  of  weapon  systems 
mix,  force  structure,  tactics  and  operational  concepts,  means  must  be  avail¬ 
able  to  study  all  of  the  force  elements  in  an  interdependent  fashion  so  as 
to  show  the  possibilities  and  the  limits  of  their  mutual  substitutability. 

There  is  some  indication  that  piecemeal  thinking  and  analysis  has  led  us  to 
opt  rather  strongly  for  technology  resulting  in  systems  that  are  perhaps  not 
very  robust  and  rather  expensive  so  that  we  may  soon  no  longer  be  able  to 
afford  force  sizes  that  make  sense  operationally. 
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DECISION  ALGORITHMS  IN  FUZZY  SITUATIONS 


H.-J.  Zimmermann,  Institute  of  Technology 
Tempi  erg raben  55,  5100  Aachen  (FRG) 

ABSTRACT,  Classical  decision  models  and  algorithms  are  either  dichotomous  in 
character  (feasable  -  nonfeasable,  optimal  -  not  optimal)  or  they  are  sto¬ 
chastic.  There  are,  however,  many  decision  making  situations  which  are  ill- 
structured  or  vague  and  which  cannot  properly  be  modelled  by  use  of  classical 
tools.  Fuzzy  set  theory  has  been  put  forward  as  a  possible  bridge  between 
models  and  reality  in  above  mentioned  vaguely  described  situations.  In  the  mean 
time  more  than  4000  publications  are  available  in  the  area  of  fuzzy  set  theory 
and  its  applications.  Of  particular  interest  for  fuzzy  set  modelling  seems 
to  be  the  area  of  decision  making.  Algorithmic  approaches  such  as  fuzzy  linear 
programming  as  well  as  results  of  axiomatic  and  empirical  research  and  their 
application  to  civilian  and  military  problems  are  represented. 

1.  INTRODUCTION  Most  of  our  available  models  and  algorithms  are  "crisp",  i.e. 
based  on  traditional  mathematics  or  dual  logic  which  are  both  dichotomous  in 
character.  This  is  certainly  appropriate  if  the  problem  under  consideration 
is  of  the  yes-cr-no  type  as  frequently  encountered  in  the  physical  sciences, 
in  engineering  or  in  hardware  design.  Here  we  can  clearly  distinguish  between 
optimal  and  nonoptimal,  feasable  and  nonfeasable  solutions  etc. 

Decision  making  normally  involves  human  judgments,  evaluations  and  percep¬ 
tions.  Their  structure  is  not  dichotomous  but  rather  vague,  not  of  the  yes- 
no  but  of  the  more-or-less  type.  If  an  essentially  two-values  modelling  langu¬ 
age  is  used  to  model  this  type  of  problem  then  in-appropriate  models  may 
result  and  model-solutions  do  not  coincide  with  problem  solutions. 

There  are  different  types  of  vagueness  which  have  to  be  taken  into  account; 
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1*  Vagueness  concerning  the  occurrence  of  crisply  described  events.  This  is 
and  has  been  the  domain  of  probability  theory.  Typical  statements  of  that  type 
are:  "The  probability  of  hitting  the  target  is  .6"  or  "There  is  a  good  chance 
of  meeting  hirn" . 

1 1 .  Vague  Phenomina 

a ,  )  I rit r i  ns i  c_  V aguones s 

This  i's  a  type  of  vagueness  which  is  due  to  the  vagueness  of  human  judg¬ 
ments  and  concepts.  Examples  are  terms  such  as  "tall  men",  "acceptable 
profits",  "high  vulnerability",  "long  sticks"  etc. 

b .  )  In formati  onal  Va g u e ne s_s 

Wo  arc  accustomed  to  the  view  that  a  lack  of  information  causes  vague¬ 
ness  (for  instance  in  stochastic  statements).  There  is,  however,  also  a 
type  of  vagueness  which  is  due  to  an  abundance  of  information.  A  "credit¬ 
worthy"  person  could,  for  instance,  be  fully  described  by  using  a  large 
number  of  "descri ptors" .  Since  the  human  capacity  for  information  proces¬ 
sing  and  storage  is  very  limited,  not  all  the  necessary  descriptors  will 
be  in  the  mind  of  a  person  when  using  the  term  creditworthiness.  Human 
beings  can  still  communicate  using  these  terms  which  are  generally  called 
"subjective  categories".  But  when  doing  so  the  set  of  "creditworthy  men" 
is  no  longer  a  set  in  tiie c I assi ca  1  sense  but  rather  a  category  with 
vague  bounder les.  Decomposing  such  terms  normally  makes  the  subcategories 
better  and  sharper  defined  because  fewer  descriptors  are  needed  to  des¬ 
cribe  them  sufficiently  and  men  can  be  aware  of  a  larger  fraction  of  these 
descriptors  at  the  time  when  using  the  terms.  The  next  three  pictures 
despict  the  hierarchies  of  "Subjective  categories"  and  their  subcategories. 
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Figure  1:  Evaluation  hierarchy  (SC  =  Subjective  Category) 

/ 


Empirically  determined  hierarchy  of  subjective  categories 
explicating  the  concept  at  creditworthiness 


Figure  2:  Empirically  determined  hierarchy  of  subjective  categories  explicating 
the  concept  of  credi tworthiness 
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Figure  3:  Measure  of  Effectiveness  (here  readiness  to 


III.  Vague  relationship 

Statements  relating  phenomina  to  each  other  can  also  be  vague.  Examples  of 
this  type  of  statements  are  “not  much  larger  than",  "approximately  equal 
to..."  etc. 

Intrinsic  and  informational  vaguenesses  are  best  characterized  by  the  following 
citations: 


In  1923  B.  Russell  noted  already: 

"All  traditional  logic  habitually  assumes  that  precise  symbols  are  being  em¬ 
ployed.  It  is  therefore  not  applicable  to  this  terrestrial  life  but  only 
to  an  imagined  celestial  existence." 

Principle  of  Incompatibility:  (Zadeh  1965):  "As  the  complexity  of  a  system 
increases  our  ability  to  make  precise  and  yet  significant  statements  about 
its  behaviour  diminishes  until  a  threshold  is  reached  beyond  which  preci¬ 
sion  and  significance  (or  relevance)  become  almost  mutually  exclusive 
characteristics." 

2.  What  are  Fuzzy  Sets 


The  notion  of  a  fuzzy  set  and  the  axiomatic  system  of  fuzzy  set  theory  was  put 
forward  by  L.  Zadeh  in  1965.  Here  just  some  of  his  basic  definitions: 


Fuzzy  Set: 

If  X  =  [x}  is  a  collection  of  objects  denoted  generically  by  x  then  a  Fuzzy 
Set  A  in  X  is  a  set  of  ordered  pairs 

A  =  { (x ,  nA(x) )  |  x  e  X}  . 
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M^(x)  is  called  a  membership  function  or  grade  of  membership  of  x  in 
A  which  maps  X  to  the  membership  space  M.  The  range  of  the  membership  func¬ 
tion  is  a  subset  of  the  nonnegative  real  numbers  whose  supremum  is  finite. 

Equality :  Two  fuzzy  sets  A  and  B  are  equal  iff 

MA(x)  =  Mb(x)  for  all  xeX 

Intersection:  The  membership  function  of  A  n  B  is  given  by 

(’‘arid") 

UAn|j(x)  =  Min  ( M A ( X ) .  1JB(X)) 

Union:  The  membership  function  of  A  u  B  is  defined  as 

("or") 

MAuB(x)  =  Max  ^A(x)> 

Example:  Let  X  =  (10,  20,  30,  40,  50,  60,  70,  80,  90,  100,  1101 

be  possible  speeds  (mph)  at  which  cars  can  cruise  over  long 
distances.  The  fuzzy  set  A  of  “comfortable  speeds  for  long 
distances11  may  be  defined  by  certain  individual  as 

A  =  {(30,  0.7),  (40,  0.75),  (50,  0.8),  (60,  0.8),  (70,  1.0) 

(80,  0.8),  (90,  0,3)} 


Support:  The  support  of  a  fuzzy  set  A  is  a  set  S( A)  such  that  x  e  X( A)  iff 
M-j (x)  >  0. 

Normality:  It  has  already  been  mentioned  that  the  membership  function  is  not 
limited  to  values  between  0  and  1.  If  SupxuA(x)  =  1  the  fuzzy  set 
A  is  called  normal.  A  non-empty  fuzzy  set  A  can  always  be  normalized 
by  dividing  uA(x)  by  Supxu1(x). 
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Algebraic  Product:  The  membership  functioning  of  the  algebraic  product  of 

two  fuzzy  sets  A  and  B  is  defined  as: 

WAB  =(mA  '  UB)* 

Algebraic  Sum:  The  membership  function  of  the  algebraic  sum  of  A  and  B  is 
also  defined  by  its  membership  function: 

^AQB  =  +  UB  -  ^Vb 

Relation :  A  fuzzy  relation,  R,  in  the  product  space  X  x  Y  =  l (x,y)}  |  x  e  X, 
y  e  Y  is  a  fuzzy  set  in  X  x  Y,  whose  membership  function  asso- 
ziates  with  each  ordered  pair  (x,y)  a  grade  of  membership  pD(x,y) 
in  R.  An  n-ary  Relation  in  a  product  space  X  =  X1  x  X  x  ...  x  Xn 
is  then  characterized  by  a  corresponding  n-variate  membership  func¬ 
tion.  7 

The  notion  of  a  "decision11  has  always  had  very  many  different  semantic  inter¬ 
pretations.  Two  distinct  approaches  are  of  particular  importance:  In  cognitive 
(descriptive)  decision  theory  a  decision  is  an  information  processing  process 
which  can  either  lead  to  an  evaluation  (measure  of  effectivness)  to  a  ranking 
of  different  alternatives  or  to  an  "optimal  solution'*.  Probably  better  known 
is  the  definition  of  a  decision  which  is  used  in  normative  decision  theory 
(logic  of  decisions).  Here  a  decision  is  the  act  of  selecting  a  specific  solu¬ 
tion  (action)  which  is  feasable  (element  of  the  solution  space)  and  optimal 
(f.i.  maximising  an  objective  function). 

The  latter  notion  lead  Bellman  and  Zadeh  in  1970  to  define  a  decision  in  a 
fuzzy  environment  as  follows: 

in  a  fuzzy  decision  situation  the  constraints  as  well  as  the  objective  func¬ 
tion^)  can  be  fuzzy  sets,  characterized  by  their  membershipfunctions  and  the 
"decision"  is  the  (fuzzy)  set  of  all  activities  which  are  members  of  the  fuzzy 
constraints  sets  and  the  fuzzy  sets  characterizing  the  objective  function(s) 
i.e.  the  "decision"  is  the  intersection  of  all  fuzzy  sets  involved  (objective 
functions  and  constraints). 
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Example  1 : 


The  board  of  directors  is  trying  to  find  the  "optimal"  dividend  to  be  payed 
to  the  shareholders.  For  financial  reasons  it  ought  to  be  attractive  and  for 
reasons  of  wage  negotiations  it  should  be  modest.  The  fuzzy  set  of  the  objec¬ 
tive  function  "attractive  dividends"  could  for  instance  be  defined  by: 

r  1  for  x  >  5.8 

|in(x)  =  i  — L - .  ( -464x3  +  7961  x2  -  14530x  +  7033)  for  1  <  x  <  5.8 

u  1  100.000  -  - 

I 

0  for  x  <  1 


The  fuzzy  set  "decision"  is  then  pQ  =  Min  (Mq(x),  Pq(x)) 


|ilxl 


Figure  4:  A  fuzzy  decision 
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So  far  the  "decision"  is  still  a  fuzzy  set.  A  reasonable  way  of  picking 
a  dividend  would  now  be  to  select  the  solution  which  lias  the  highest  degree 
of  membership  in  the  decision  set.  In  our  example  this  "optimal  decision" 
would  be  the  dividend  =  3.5%  with  a  degree  of  membership  in  the  decision 
set  of  pd(x°)  =  .338. 

A  second  example  which  could  also  be  regarded  as  a  decision  is  the  following: 
Example  2: 


An  instructor  at  a  university  has  to  decide  how  to  grade  written  test  papers. 

Let  us  assume  that  the  problem  to  be  solved  in  the  test  was  a  linear  programming 
problem  and  that  the  student  was  free  to  solve  it  either  graphically  or  using 
the  simplex  method.  The  student  has  done  both.  The  student* s  performance  is 
expressed  -  for  thegraphical  solution  as  well  as  for  the  algebraic  solution  - 
as  the  achieved  degree  of  membership  in  the  fuzzy  sets  'good  graphical  so¬ 
lution'  (G)  and  'good  simplex  solution'  (S),  respectively.  Let  us  assume  that 
he  reaches 

Mq  =  0.9  and  \i^  =  0.7. 

If  the  grade  to  be  awarded  by  the  instructor  corresponds  to  the  degree  of 
member sh i p  of  the  fuzzy  net  'good  solutions  of  linear  programming  problems' 
it  would  be  quite  conceivable  that  this  grade  p^p  could  be  determined  by 

PLp  =  Max(  pG,  ps)  =  Max  (0.9,  0.7)  =  0,9. 
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3.  Fuzzy  Optimization 


As  a  special  structure  of  an  optimization  problem  we  shall  choose  Linear  Pro¬ 
gramming.  Let  us  consider  the  following  cases: 

A  company  wanted  to  decide  on  the  size  and  structure  of  its  truck  fleet.  Four 
differently  sized  trucks  (x1  through  x4)  were  considered.  The  objective  was  to 
minimize  cost  and  the  constraints  were  to  supply  all  customers  (who  have  a 

i 

strong  seasonally  fluctuating  demand).  That  meant:  certain  quantities  had  to 
be  moved  (quantity  constraint)  and  a  minmum  number  of  customers  per  day  had 
to  be  contacted  (routing  constraint).  For  other  reasons,  it  was  required  that 
at  least  6  of  the  smallest  trucks  be  included  in  the  fleet.  The  management  wan¬ 
ted  to  use  quantitative  analysis  and  agreed  to  the  following  suggested  linear 
p  r  o  g  ra  trim  ing-upp  roach: 

Minimize 

4 1 . 400x  ^  +  44.300x2  +  +  49,10Qx4 

subject  to  constraints 

0.84x1  +  1.44x2  +  2.16X3  +  2.40x4  >  170 

1 6x 1  +  16x2  +  I6X3  +  16x4  >  1,300 

Xj  >  6. 

The  solution  was  x^  =  6,  x2  =  17.85,  X3  =  0,  x4  =  58,64.  Min  Cost  =  3.670.850. 
Since  the  management  felt  that  is  was  forced  into  giving  precise  constraints 
(because  of  the  model)  in  spite  of  the  fact  that  it  would  rather  have  given 
some  intervals,  the  following  "fuzzy"  approach  was  used: 

Starting  from  the  problem: 

Minimize  Z  =  cx 

subject  to  constraints  Ax  <  b 

x  >  0, 

the  adopted  "fuzzy"  version  was 

cx  <  Z 
Ax  <  b 

it 

x  >  0. 
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Here  c  is  the  vector  of  coefficients  of  the  objective  function,  b  is  the 
vector  of  constraints  (in  our  case  the  amount  to  be  shipped,  the  number  of 
customers  to  be  contacted  and  the  minimum  number  of  small  trucks  required), 
arid  A  is  the  coefficient  matrix.  The  symbol  "<"  denotes  the  fuzzied  version 
of  and  reads  "essentially  smaller  than  or  equal  to". 

We  now  define  a  function  f:  Rrn+1  -[0,1]  such  that 

10  if  Ax  5  b  and  cx  <  Z  is  strongly  violated 
1  if  Ax  <  b  and  cx  <  Z  is  satisfied. 

Using  the  simplest  version  of  the  function  (Ax,cx)  we  assume  it  to  be  linear 
and  the  intersection  of  the  (fuzzy)  constraints  and  the  (fuzzy)  objective 
function. 


Thus 

f ( Ax ,cx)  =  f(Bx)  =  Min  f - ( Bx) A ,  x  >  0 

with 


f i (Bx).  )  . 


for  (Bx)j  <  bj 

for  b-  <  Bxj  <  b-  +  d. 

for  (Bx).  >  bi  +  di 


where  d-  are  subjectively  chosen  constants  of  admissible  violations  of  the 
constraints, 

f-(Bx).  is  the  membership  function  of  the  i-th  row  of  the  linear  system  Bx.f 


Min  f . (Bx) ^ ) 

is  the  "fuzzy"  decision 

and 

Max  Min  fj(Bx)j) 

the  decision  with  the  highest  degree  of  membership. 
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Substituting. 


componentwise  and  simplifying  it  by  dropping  the  "1"  (which  does  not  change  the 
problem!)  Yz  arrive  at  the  following  problem: 

Ma>  Kin  (b|  -  (B'x).)  (x) 

x>G  i  1 2 3 4 5 

or 

Max  gn(x) 
x>0  u 

As  is  well  r-own,  this  problem  is  equivalent  to  solving  the  following  linear 
program: 

Maximize  x 

subject  to  constraints  x<b.  -  (tlx).,  i  = 

x  >  0 

The  optimal  solution  to  (x)  is  also  the  optimal  solution  to  (xx). 

When  this  approach  was  applied  to  our  problem  the  following  assumptions  were 
made: 

1)  Total  cost  should  not  rise  above  4.200.000  (budget  limit). 

2)  The  "unfuzzy"  constraints  are  minimum  requirements  and  management  would  feel 
much  better  if  there  was  some  "leeway". 

3)  The  linear  approximation  of  the  membership  functions  are  acceptable. 

4)  There  are  no  interdepencies  between  the  constraints. 

5)  Weighting  of  the  constraints  is  taken  care  of  by  defining  the  constants  d^. 
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The  results  are  shown  in  the  following  table: 


Unfuzzy 

Fuzzy 

• 

0  =  0 

U  =  1 

Objective  Function 

4.200.000 

3.700.000 

1st  constraint  170 

170 

180 

2nd  constraint  1 .300 

1.300 

1.400 

3rd  constraint  6 

6 

12 

Our  non-fuzzy  equivalent  problem  in  the  form  of  (xx] 

Maxi  in  i/o  a 

subject  to  constraints 

\  <  7.4  -  O.G83x^  - 

0.089x2 

-  0.096x3  - 

0.098x4 

x  -  Ifi  f  0.084x^ 

+  0.144xo 

C  \ 

+  0.216x3  +  0 

,24x4 

X  <  -  14  +  Q.16x1  + 

O.I6X2 

+  0.16x3  +  0. 

16x4 

X  <  -  2  +  0.167 

X1 »x2,x3,x4  —  0- 

Solution: 

Unfuzzy 

Fuzzy 

xt  -  6 

X1  =  17.41 

x2  =  17.85 

x4  =  66.54 

x^  -  58.65 

Z  =  3.918.850 

Z  =  3.988 

.257 

Constraints : 

1.  171.5 

174.2 

2.  1.320 

1  .342.4 

3.  6 

17.4 
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As  can  be  seen  from  the  solution,  "leeway"  has  been  provided  with  respect  to 
all  constraints  and  at  additional  cost  of  1.7  percent. 

The  main  advantage,  compared  to  the  unfuzzy  problem  formulation,  is  the  fact 
that  the  decision  maker  is  not  forced  into  a  precise  formulation  because  of 
mathematical  reasons  even  though  he  might  only  be  able  or  willing  to  describe 
his  problem  in  fuzzy  terms.  Linear  membership  functions  are  obviously  only  a 
very  rough  approximation.  Membership  functions  which  monotonical ly  increase 
or  decrease,  respectively,  in  the  interval  of  [b.,b^+d^l  can  also  be  handled 
quite  easily. 

The  above  fuzzy  LP  approach  can  also  be  used  to  tackle  multi-cri teria-problerns 
very  efficiently. 

In  the  following  we  shall  restrict  our  considerations  to  linear  programming 
problems  with  vectorvalued  objective  functions.  For  instance  we  shall  use  the 
following  example: 


Example  3 


s.th.  -x1  +  3x2  <  21 
x1  +  3x2  <  27 
4x1  +  3x2  £  45 
3x  ^  +  x2  <  30 
xr  x2  >  0. 
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Figure  5:  A  Vector  Maximum  Problem 


Figure  5  shows  the  solution  space  of  this  problem.  The  "complete  solution" 
is  the  edge  x1  -  x2  -  x3  -  x4.  x1  is  optimal  with  respect  to  objective  Func¬ 
tion  z1  =  -x1  +  2x2.  x  is  optimal  with  respect  to  objective  function 
z2  =  2x1  +  x2.  The  "optimal"  values  are  (x1)  =  14  and  (x4)  =  21, 
respectively .  For  x1  =  (7,0)  zg  =  7  and  x4  =  (9,3)  yields  z,  =  -3. 

5 

Solution  x  =  (3.4,  0.2)  is  the  solution  which  yields  ~  -3,  z2  “  7  the 
lowest  "justifyable"  values  of  the  objective  functions  in  the  sense  that  a 
further  decrease  of  the  value  of  one  objective  functions  could  not  be  "balanced" 
or  even  counteracted  by  an  increase  of  the  value  of  the  other  objective  function. 

We  shall  now  apply  our  FLP  approach  to  the  vectormaximum  problem  and  make  the 
fol lowi ng  assumptions : 

The  membership  functions  p^  (x)  and  p2  (x)  of  the  fuzzy  sets  characteri zing 
the  objective  functions  rise  linearly  from  0  to  1  at  the  highest  achievable 
values  of  z1  =  14  and  z2  =  21 ,  respectively.  Thus 
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0 


^  (X)  = 


for  <  -3 

i  z^x)  +  3 

l  — -  for  -3  <  z1  <  14 

17  1 

J  for  14  <  z1 


u2(x)  = 


'0  for  Z2  <  7 

z2(x)  -  7 

<  — -  for  7  <  z,  <  21 

14  ‘ 

1  for  21  <  z9 


We  arrive  at  the  following  problem 
Max  x 

s.th.  X  <  -0.05882X,  +  0.117  x2  +  0.1764 
X  <  +0.1429  x1  +  0.0714x2  +  0.5 


21  > 

-xi * 

3x, 

,  i 

27  > 

xi + 

3x 

i 

45  > 

4x^  + 

3x; 

30  > 

3x1  + 

x 
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Nonlinear  membership  functions 


Hersh  and  Caramazzy  have  shown  that  membership  functions  of  the  following 
type  exist:  (also  compatible  with  economic  laws  of  decreasing  rate  of  in¬ 
crease  of  utility): 


fly) 


i  (c)‘  1 1 


f  (b) 


f (a)“0 


a  b  c 


Figure  7:  Nonlinear  Membership  Function 

Such  a  nonlinear  membership  function  can  be  described  by  a  hyperbolic 
function.  For  each  objective  function  z.,  j  =  the  corresponding 

u  J 

hyperbolic  membership  function  pn  is  defined  as  follows. 


H  .  N  It* 
:=  2 

D  . 


CD  O 

<,.(>:)-  2- 


m  o 

)  a  -It.  .  <*)-  -  )<, 

3  -  i,  '■>  J  i 


Uj  O 

U  .  (>:)  •  n  -)u 
3  2  3 


+  G  3 


Hi  O 

M-  \  -  )ci  . 
2  3 


4 


1 


where  ^  is  a  parameter. 


The  problem  now  reads: 


Max  x 
such  that 


mo  ib  o 

z  .  +z  .  z  ,  +  z  , 

(z  (x) — 2_1)  q  -(z.(x) — 3 )  a  . 

!  3  2  3  -  e  3  2  3 


mo  LB  o 

z  .  +  Z  .  Z  .  +  Z  . 

( Z  .  ( X )  — 1)  n  .  -  ( z  .  ( X )  — ^-—4  a 

f  3  2  3  f-  e  3  23 


1  ,  -  -  .  ,k 


Ax  £  b 
x  £  O 
X  ^  O. 


This  is  a  nonlinear  programming  problem  with  one  linear  objective  function, 
k  nonlinear  and  m+n+1  linear  restrictions.  We  shall  now  show  that  there 
exists  an  equivalent  linear  optimization  problem. 

It  can  be  shown  (Leberling  1980)  that  for 

xr|+1  -  Lmli  1  (  2a-  1 ) 

an  equivalent  formulation  is 

Max  xn+1 

such  that  ojZj(x)  -  xn+,  >  JcjjtzJ  +  z°),  j  =  1 . k. 

Ax  <  b 
x  >  0. 

For  our  problem:  {For  ^  =  3/  ^  ( z j  -  z j ) )  we  obtain  the  following  problem 
formulation: 
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Max  x 
such  that 


x  - 


-5,5)1T  _  ^(-x^zxa-s.s)^ 
2  ^(-x^Zx^S.S)^  +  -|-)f1HV5'5)167 


X  -  Jr 


1  e(2x1+x2-14)A  _  -IZx^Xj-H)^ 


+  e 


1 

2 


-1«> 

6 

14 

-4 

i 

2 

■*1 

4  3*2 

£ 

21 

*1 

+  3>:2 

<■ 

27 

4x1 

4  3x2 

£ 

45 

3x3 

£ 

30 

X1 

£ 

0 

>:2 

p 

0 

X 

? 

0 

The  equivalent  problem  formulation  is 


Max 

such  that 


-6x1 

4 

1 2x  2  - 

17x3 

> 

33 

1 2x  ^ 

+ 

6x2  - 

14x3 

> 

84 

-  xi 

+ 

3x2 

< 

21 

xi 

4 

3x2 

< 

27 

4xi 

4 

3  x  2 

45 

3xi 

4 

x2 

< 

30 

xi 

> 

0 

x2 

> 

0 
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The  optimal  solution  is 

(X1  Pt;  x2°Pt;  x3°Pt)  =  (5.03;  7.32;  1.45)T. 

The  maximum  degree  of  membership  of  satisfaction 
x°Pt  =  j  tanh  (x3°pt)  +  J  =  0.95 

is  achieved  at  the  solution  (x^;  x^)7  =  (5.03;  7.32)7. 

This  solution  os  the  wanted  compromise  solution  of  the  given  LVOP 


mMk 


=~4 


5<vvyAA 


Figure  ft;  The  Nonlinear  Case 


4.  Fuzzy  Evaluation 


Examples  of  evaluation  problems  are  the  determination  of  creditworthiness 
and  “readiness  to  fight",  which  were  mentioned  at  the  beginning.  Evaluation 
and  optimization  problems  certainly  have  one  feature  in  common:  Fuzzy  sets 
representing  subjective  categories,  objectives  or  constraints  have  to  be 
aggregated. 

Comparing  the  dividend  example  with  the  grad i ng_  problem  it  seems  appropri¬ 
ate  to  reconsider  the  original  definition  of  a  decision  as  the  “intersection 
of  ail  fuzzy  sets  involved".  The  grading  example  would  suggest  rather  to  use 
the  union  as  a  model  and  the  two  evaluation  models  do  not  give  us  any  hint 
in  which  way  the  aggregation  ought  to  be  accomplished. 

The  two  definitions  of  decisions  -  as  the  intersection  or  the  union  of  fuzzy 
sets  -  imply  essentially  the  following: 

The  interpretations  of  a  decision  as  the  intersection  of  fuzzy  sets  implies 
no  positive  compensation  (trade-off)  between  the  degrees  of  membership  of 
the  fuzzy  sets  in  question,  if  either  the  minimum  or  the  product  is  used  as 
an  operator.  Each  of  them  yields  degrees  of  membership  of  the  resulting 
fuzzy  set  (decision)  which  are  on  or  below  the  lowest  degree  of  membership 
of  all  intersecting  fuzzy  sets  (see  Example  1). 

The  interpretation  of  a  decision  as  the  union  of  fuzzy  sets,  using  the  max- 
operator,  leads  to  the  maximum  degree  of  membership  achieved  by  any  of  the 
fuzzy  sets  representing  objectives  or  constraints.  This  amounts  to  a  full 
compensation  of  lower  degrees  of  membership  by  the  maximum  degree  of  mem¬ 
bership  (see  Example  2). 

Observing  managerial  or  military  decisions  one  finds  that  there  are  hardly 
any  decisions  with  no  compensation  (trade-offs)  between  either  different 
degrees  of  goal  achievement  or  the  degree  to  which  restrictions  are  limi¬ 
ting  the  scope  of  decisions.  The  compensation,  however,  rarely  ever  seems 
to  be  "complete"  such  as  would  be  assumed  using  the  max-operator. 
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It  may  be  argued  that  compensatory  tendencies  in  human  aggregation  are 
responsible  for  the  failure  of  some  classical  operators  (min,  product, 
max)  in  empirical  investigations. 

It  is  crucial (for  the  appropriate  modelling  of  decisions  of  whatever  form 
to  know: 

1.  How  to  model  properly  subjective  categories  by  fuzzy  sets  (i.e.  how 
to  determine  membership  functions). 

2.  How  to  aggregate  fuzzy  sets  appropriately  in  order  to  arrive  at  a  de¬ 
cision,  a  judgment,  a  ranking  or  an  evaluation. 

Two  ways  of  proceeding  are  conceivable: 

a)  The  axiomatic  approach  (as  in  utility  theory),  establishing  an  axiomatic 
system  which  is  at  least  plausible  and  derive  the  resulting  mathematical 
models  for  aggregators  and  membership  functions. 

b)  Empirical  investigations  into  the  ways  people  think  of  subjective  cate¬ 
gories  (i.e.  determining  appropriate  membership  functions)  and  aggregate 
themto  arrive  at  conclusions  (i.e.  determining  models*  for  aggregates). 

Some  authors  have  used  the  first  approach  [  2  ]  .  We  shall  report  on  some 
results  of  doing  research  of  the  latter  type. 


5.  Empirical  Results 


5.1  Aggregators  (Connectives) 

In  earlier  studies  (6,9]  it  was  shown,  that  neither  minimum  nor  product 
operator  model  properly  the  human  "logical  and".  During  these  investiga¬ 
tions  it  appeared,  however,  that  in  managerial  decisions  the  "logical 
and"  (without  trade-offs")  is  not  used  at  all. 
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Our  hypothesis  was  that  human  beings  use  many  non-verbal  connectives  in 
their  thinking  and  reasoning.  One  type  of  these  connectives  may  be 
called  "merging  connectives"  which  may  be  represented  by  the  "compensa- 
Lory  and".  Being  forced  to  verbalize  them  men  possibly  map  the  set  of 
"merging  connectives"  into  the  set  of  the  corresponding  language  connec¬ 
tives  ("and",  "or").  Hence,  when  talking,  they  use  the  verbal  connec¬ 
tives  which  they  feel  closest  to  their  "real"  non-verbal  connective. 

Thus  a  new  word  "compensatory  and"  had  to  be  coined  and  possible  mathe¬ 
matical  models  for  it  tested.  A  number  of  models,  such  as  the  minimum, 
the  maximum,  the  geometric  mean,  the  arithmetric  mean  etc.,  were  tested. 
In  addition  a  special  model,  the  so-called  y -operator  was  tested. 

The  following  figures  indicate  some  of  the  results  : 
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Figure  10:  Min-operator:  Observed  versus  computed  grades  of  membership 
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Figure  12:  Arithmetic  mean:  Observed  versus  computed  grades  of  membership 
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Figure  13: Y -opera tor:  Observed  versus  computed  grades  of  membership 
The  ^-operator  performed  best.  The  idea  behind  it  is  the  following: 
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Jf  several  operators  are  necessary  iri  order  to  describe  a  variety,  of  phenomena, 
the  question  arises,  how  many  operators  are  needed,  as  each  important  situation 
in  practise  would  then  call  for  an  adequate  model.  Moreover,  one  would  be 
forced  to  assume  that  man  has  a  decision  rule  enabling  him  to  choose  the  right 
connective  for  each  situation.  The  pursuit  of  this  train  of  thought  and  especi¬ 
ally  its  application  implies  a  lot  of  difficulties.  We  feel  that  one  way  to 
bypass  these  difficulties  is  to  generalize  the  classical  concept  of  connectives 
by  introducing  a  parameter  7  which  may  be  interpreted  as  "grade  of  compensation". 

Each  point  of  the  continuum  between  "and"  and  "or"  represents  a  different  ope¬ 
rator.  One  way  to  formalize  this  idea  is  to  find  an  algebraic  representation 
for  a  weighted  combination  of  the  non-compensatory  "and"  and  the  fully  compensa¬ 
tory  "or".*  The  more  there  is  a  tendency  for  compensation  the  more  the  "or"  be¬ 
comes  effective  and  vice  versa.  As  "extremal"  operators  we  prefer  the  product 
and  the  algebraic  sum.  Of  course,  other  "extremes"  are  conceivable,  for  instan¬ 
ce  minimum  and  maximum.  But  in  our  opinion  these  models  are  handicapped  as  they 
do  not  reflect  the  interaction  of  membership  values.  Thus  we  define: 

/ 

1-7  7 

yA5B  '  ^AnB  '  mAuB  •  (x) 


The  membership  of  an  object  in  the  set  A®B  equals  the  product  of  the  weighted 
membership  values  for  the  intersection  and  the  union. 

If  the  intersection  and  the  union  are  algebraically  represented  by  the  product 
and  the  algebraic  sum,  respectively,  then  the  operator  becomes: 


f  m  N 

l'G  =QJUi 


1  - 


(1 


0  <  M  <  1  ,0  <  7  <  1 


=  1 


i  =  m  =  number  of  sets  to  be  connected. 


It  can  be  shown  that  the  7-operator  is  pointwise  injective,  continuous, 
monotonous,  commutative  and  in  accordance  with  the  truth  tables  of  dual  logic. 
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Using  this  operator  a  model  with  very  high  predictive  power  could,  for  instance, 
be  derived  for  the  credit  worthiness  decision.  The  y's  were  determined  from 
observations  and  then  the  following  hierarchy  resulted: 


Ci edit 

worthiness 


Figure  14:  Empirically  determined  values  of  y  for  the  creditworthiness  hierarchy 


5.2  Membership  Functions 


When  determining  membership  functions  empirically  one  first  has  to  decide  on 
which  scale  level  the  resulting  function  lias  to  be.  This  depends  on  the  inten¬ 
ded  use  of  membership  functions.  If  they  are  to  be  used,  for  instance,  in  the 
framwork  of  fuzzy  linear  programming  then  they  have  to  be  on  an  absolute  seal 
level.  It  is  well  known,  however,  that  the  human  being  is  no  reliable  measure¬ 
ment  device  on  this  level.  In  [6  ]  it  was  shown  how  to  generate  ’'quasi-cardinal" 
degrees  of  membership  from  human  answers.  These  degrees  of  membership  did  not 
constitute,  however,  "membership  functions 11  (membership  as  a  function  of  a  vari¬ 
able  which  is  to  be  optimized)  as  one  needs  them  for  mathematical  programming. 
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For  this  purpose  another  approach  was  used: 


Given  that  for  a  subjective  category  under  consideration  (for  instance  "young 
men")  there  exists  a  judgmental  scale  (years) ,  and  an  evaLuational  scale 
(degree  of  being  young)  then  the  membership  function  of  the  fuzzy  set  can  be 
Viewed  as  the  "distance"  of  these  two  scales.  If  the  distance  function  is 
called  d(x)  then  the  membership  function  (Type  II)  is 


M  ( X )  = 


1  +  d(x) 


Different  models  for  the  distance  function  are  possible.  The  one  that  performed 
best  so  far  is  -  not  surprisingly  -  the  logistic  function  d(x)  =  e~a^x+b'. 


The  following  pictures  indicate  some  of  the  results  of  empirically  testing 
the  membership  function 

u(x)  =  -  1 


1  +  e 


-a(x+bj 


The  lines  represent  the  mathematical  model  after  the  parameters  a  and  b  have 
been  determined  for  proper  calibration.  The  dots  represent  the  observations. 


►*u)  m(x) 


Figure  15:  Fuzzy  Set  "Young  Men" 
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ARTILLERY  CONTROL  ENVIRONMENT: 
AN  EXPERIMENTAL  TOOL 


Jill  H.  Smith  and  Jock  0.  Grynovicki 
Experimental  Design  and  Analysis  Branch/ACE  Team 
Systems  Engineering  and  Concepts  Analysis  Division 
Ballistic  Research  Laboratory 
Armament  Research  and  Development  Center/USAAMCCOM 
Aberdeen  Proving  Ground,  MD  21005 


ABSTRACT  ,  The  Army  is  fielding  a  new  digital 
communications  system,  the  TACFIRE  system,  shown  for  the 
brigade-area  in  Figure  1.  In  order  to  investigate  the 
command,  control,  and  communications  issues  associated  with 
the  new  devices,  the  Artillery  Control  Environment  (ACE)  was 
developed.  ACE  is  a  real-time,  multiplayer,  interactive 
simulation  system  run  on  a  commercial  computer  that 
interfaces  with  the  tactical  equipment  through  a  bit  box 
(microprocessor)*  This  paper  will  discuss  the  preparations, 
experimental  design,  data  collection  and  analysis  methods 
for  the  first  experiment  with  military  players  interfaced 
with  the  Artillery  Control  Environment  software  to  be 
conducted  8  May  -  10  June  83. 


I*  INTRODUCTION.  In  May  1982,  the  HELBAT  (Human 
Engineering  Laboratory  Battalion  Artillery  Test)  Executive 
Committee  agreed  that  the  Ballistic  Research  Laboratory 
Artillery  Control  Environment  (ACE)  and  HELBAT  activities 
should  be  combined  to  develop  a  Command  Post  Exercise 
Research  Facility  (CPXRF).  The  CPXRF  can  be  used  to 
demonstrate  the  use  of  commercial  automatic  data  processing 
(ADP)  technology  for  the  RDTAE  (research,  development, 
testing,  and  evaluation)  of  tactical  ADP  fire  support 
control  systems  and  also,  for  training  personnel  to  operate 
these  systems;  the  HEL/BRL  CPXRF,  however,  will  primarily  be 
used  for  research  and  exploratory  development  work. 
Further,  an  ACE/CPXRF  Subcommittee  was  formed  to  provide 
joint  DARC0M-TRAD0C  guidance  in  the  development  of  ACE 
technology  and  use  of  the  CPXRF,  The  ACE  software  is  a  key 
tool  in  the  CPXRF.  The  software  features  the  ability  to 
automatically  load  live  players  with  messages  produced  by 
target  acquisition  and  fire  direction  simulators  while 
recording  all  the  message  traffic  that  flows  between  the 
live  and  simulated  players. 

An  overview  of  the  CPX  Research  Facility  and  ACE 
program  is  given  in  the  1982  Sept-Oct  issue  of  the  Field 
Artillery  Journal  in  an  article  "HELBAT/ACE  Fire  Support 
Control  Research  Facility"  by  Mr.  Barry  Reichard.  The 
layout  of  the  facility  is  shown  in  Figure  2. 
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II ,  PURPOSE .  The  experiment  detailed  in  this  plan  is 
the  first  test  in  which  military  players  will  be  interfaced 
with  the  Artillery  Control  Environment  (ACE)  software.  The 
purpose  of  this  experiment  is  to  demonstrate  the  feasibility 
of  using  the  automated  techniques  of  the  CPX  Research 
Facility  for  fire  support  control  experiments. 

To  demonstrate  this  capability,  it  was  decided  that  a 
study  of  the  effects  of  message  intensity  and  degraded 
communication  on  a  Fire  Support  Team  Headquarter 1 s  (FIST  HQ) 
ability  to  perform  fire  support  coordination  would  be 
appropriate.  Message  intensity  is  defined  to  be  a  function 
of  message  type,  message  rate,  and  message  content. 


III.  TEST  CONCEPT. 


A,  Objectives 

1)  To  determine  the  effect  of  message  intensity  on  the 
FIST  HQ's  ability  to  perform  fire  support  coordination. 


2)  To  determine  the  effect  of  degraded  communication 
on  the  FIST  HQ's  ability  to  perform  fire  support 
coordination . 


3)  To  determine  if  message  intensity  and  degraded 
communication  have  a  combined  effect  on  fire  support 
coordination . 


B.  Measures  of  Performance 

A  measure  of  performance  (MOP)  is  a  response  that  is 
used  to  quantify  the  effects  of  the  factors  to  be  evaluated. 
Because  all  of  our  objectives  investigate  the  effect  on  fire 
support  coordination,  the  measures  of  performance  will  be 
the  same  for  all  three  objectives.  The  following  measures 
of  performance  will  be  computed  on  each  two  hour  cell  of  the 
test : 

1)  Number  of  messages  serviced  (i.e.  for  which  a 
response  has  been  gener ated ) /total  number  of  messages 
received  by  the  FIST  DMD  (digital  message  device) 

2)  Time  required  to  service  a  message  (i.e.  from  the 
time  the  message  is  received  by  FIST  DMD  to  the  time  the 
response  Is  first  transmitted) 

3)  Time  from  first  transmission  of  service  message 
until  acknowledgement  (ACK)  is  received  for  that  message 
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4)  Frequency  count  by 

acknowledged 

try  number  of 

messages 

5)  Frequency  count 
acknowledged 

by  try 

number  of  messages 

never 

6)  Number  of  fire 
missions  initiated 

missions 

completed /number 

of 

fire 

7)  Number  of  fire 
missions  expected  (i.e 

missions 
.  number 

completed/number 
of  fire  missions  in 

of 

the 

fire 

data 

base) 


C.  Scope 

The  fire  support  team  will  be  a  four-man  team  consisting 

of  : 


1)  the  fire  support  team  chief 

2)  the  fire  support  sergeant 

3)  two  radio  telephone  operator/dri vers . 

The  FIST  chief  will  be  available  to  the  FIST  HQ  for 
initial  supervision  only.  As  per  typical  operating 
procedures,  the  FIST  chief  may  be  absent  for  extended 
periods  of  time  (hypothetically  accompanying  the  company 
commander ) . 

The  FIST  HQ  will  be  task-loaded  by  software 
interactively  simulating  three  platoon-level  forward 
observers.  The  software  FOSCE  (Forward  Observer  SCEnario) 
will  use  tactical  scenarios  developed  by  Mr*  Arthur  Long  of 
the  US  Army  Field  Artillery  Board.  This  scenario  or  input 
database  is  detailed  in  the  Section  D,  "Input  Data  Base" 
under  RESOURCE  REQUIREMENTS. 

The  FIST  HQ  will  have  direct  access  to  fire  support  from 
a  company-level  mortar  platoon  fire  direction  center  (FDC) 
and  a  generic  field  artillery  fire  direction  center.  All 
FDC  operations  will  be  simulated  interactively  by  software. 
The  FIST  HQ  will  determine  the  proper  action  (based  on  the 
FIST  chief’s  guidance  and  training)  for  each  fire  request; 
either  to  deny  the  request,  service  the  request  with  mortars 
or  forward  the  request.  Fire  support  will  be  unlimited, 
that  is,  not  constrained  by  ammunition  resupply. 

All  members  of  the  FIST  Headquarters  will  be  trained  in 
the  operation  of  the  FIST  DM D  to  give  the  FIST  chief 
flexibility  in  managing  his  team. 
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D.  Limitations 


1)  The  FIST  DM D  will  be  operated  in  the  review  mode 

only. 


2)  After  receiving  a  fire  request,  the  fire  mission 
will  be  forwarded  in  the  automatic  mission  mode. 


3)  No  FIST  HQ  initiated  missions  will  be  included. 

4)  No  tactical  chores  will  be  performed,  e.g.,  guard 
duty,  close  station  march  order,  emplacement,  etc. 


5)  All  communication  will  be  digital,  no  voice 
communication  , 


E.  Test  Conf igur ation 

Figure  3  shows  the  nodes  that  will  be  played  in  the 
first  military  player  test.  The  FIST  HQ  equipped  with  the 
FIST  DMD  in  the  mock-up  vehicle  interacting  through  ETHER, 
the  intracomputer  communications  network,  with  three  forward 
observer  scenario  programs,  the  mortar  fire-direction 
simulator  and  battalion  fire-direction  simulator.  Figure  4 
shows  how  these  players  communicate  together  and  the  net 
assignments . 


IV.  RESOURCE  REQUIREMENTS. 


A.  Software 

ACE  software  permits  real-time  fire  support  command  and 
control  functions  to  be  exercised  in  a  controlled  laboratory 
environment.  The  software  is  written  in  the  C  programming 
language  and  is  designed  to  run  under  the  U.lbsd  (Berkeley) 
UNIX  operating  system.  The  major  components  of  the  ACE 
software  are  described  below. 

1.  ETHER 

ETHER  is  a  single  program  which  functions  as  an  intra¬ 
computer  communications  network.  Computer  ports  are  assigned 
to  communication  nets.  ETHER  accepts  a  message  from  a  port 
and  transmits  it  to  all  other  ports  on  the  assigned  net. 
Message  collisions  are  prevented  by  separately  buffering 
each  message  within  ETHEH. 

Each  net  is  assigned  a  probability  of  message  loss 
which  ranges  from  zero  to  one.  If  the  probability  of  message 
loss  is  zero,  the  net  is  an  ideal  net  and  all  messages  are 
sent  to  each  port  on  the  net.  If  the  probability  of  message 
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loss  is  greater  than  zero,  a  uniform  random  number  generator 
is  used  to  decide  whether  or  not  a  message  is  lost.  Lost 
messages  are  not  transmitted  to  any  port  on  the  net. 

ETHER  maintains  a  log  file  of  each  message  which  it 
receives.  In  addition  to  the  raw  message,  the  log  contains 
the  times  (Julian  day,  hour,  minute,  second)  for  the  start 
of  the  message,  the  end  of  the  preamble  and  the  end  of  the 
message . 


2.  Ace  Display  (ADIS) 

ADIS  utilizes  a  CRT  (cathode  ray  tube)  terminal  to 
display  in  real  time  the  messages  being  transmitted  through 
ETHER.  The  terminal  screen  is  divided  into  eight  columns 
which  are  labeled  for  the  players  (see  Figure  5).  Each 
message  is  displayed  as  two  lines  in  both  the  senders  and 
receivers  columns.  The  message  first  appears  in  the  senders 
column.  The  first  line  contains  the  message  type  and  target 
number  if  it  has  been  assigned.  The  second  character  in  the 
second  line  is  a  ,  indicating  "sender"  and  the  time  sent 
is  given.  The  message  will  then  appear  in  the  "receivers" 
column.  The  first  line  is  the  same  as  in  the  "sender’s"  , 
the  second  character  in  the  second  line  gives  the  address  of 
the  "sender"  and  the  time  received  is  displayed.  When  the 
acknowledgement  is  sent  by  the  "receiver"  an  "*"  is 
displayed  as  the  first  character  in  the  second  line  of  the 
"receiver"  and.  when  the  acknowledgement  is  received  by  the 
"sender"  an  "*"  is  displayed  as  the  first  character  in  the 
second  line  of  the  "sender".  If  the  message  is  degraded  by 
ETHER  "MSG  LOST"  appears  in  the  receivers  column.  Below  the 
columns,  the  last  message  sent  is  interpreted.  At  the 
bottom  of  the  screen,  the  time  from  the  start  of  that  run  is 
d ispl ayed  . 

3.  Forward  Observer  Scenario  (FOSCE) 

Forward  observer  scenario  program  reads  a  database  of 
forward  observer  (FO)  messages  and  transmits  the  messages  as 
if  they  were  being  generated  by  a  real  FO.  Each  message  is 
time  tagged  so  that  FOSCE  will  know  when  to  send  it.  In 
addition,  FOSCE  will  retransmit  a  message  if  it  does  not 
receive  an  acknowledgement  and  will  wait  for  message-to- 
observer  (MTO)  and  SHOT  messages  before  transmitting 
subsequent  adjust  (SA)  messages, 

4.  Fire  Direction  Simulator  (FDS) 

The  fire  direction  simulator  consists  of  four  programs 
which  perform  a  limited  number  of  TACFIRE/BCS  functions.  FDS 
accepts  fire  request  messages,  prioritizes  them,  assigns 
target  numbers  and  generates  MTO  and  SHOT  messages.  The 
number  of  simultaneous  missions  which  the  FDS  will  process 
may  be  specified.  If  the  number  of  missions  exceeds  the 
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maximum,  the  FDS  will  process  missions  based  on  mission 
priority* 

5.  Mortar  Fire  Direction  Simulator  ( MFDS ) 

The  mortar  FDS  simulates  communication  with  the  81  mm 
company  mortars.  It  is  a  special  version  of  the  FDS  program 
which  will  only  accept  one  fire  mission  at  a  time. 

6.  Bit  Box  Program  (BBP) 

The  bit  box  interface  program  accepts  messages  from 
ETHER  and  transmits  them  to  a  computer  port  which  is 
connected  to  a  bit  box.  The  program  also  reads  messages  from 
the  computer  port  and  transmits  them  to  ETHER. 


B.  Hardware 

1.  Two  Bit  Boxes 

Bit  boxes  are  microprocessor  based  devices  which  enable 
TACFIRE  hardware  to  communicate  with  commercial  computers. 
Bit  boxes  accept  TACFIRE  messages  from  wire  line  or  radio, 
perform  error  correction  and  convert  the  messages  to  RS232 
ASCII  characters  which  commercial  computers  can  accept.  They 
will  also  accept  a  message  from  the  computer,  add  the  error 
correction  bits,  time  disperse  the  message  and  transmit  it 
over  wire  line  or  radio  in  TACFIRE  format  (FSK). 

2.  FIST  DMD 

The  FIST  digital  message  device  that  will  be  used  in 
the  experiment  is  one  of  four  experimental  design  models 
(EDM)  that  are  in  existence.  It  is  a  prototype  model,  and 
not  a  production  model. 

3.  VAX  11/750  computer 

The  VAX  11/750  computer  will  be  dedicated  to  running 
the  experiment  and  will  have  no  other  processes  running 
during  the  test.  The  operating  system  is  the  4.1bsd 
(Berkley)  UNIX. 


C,  Training 

Test  participants  will  be  trained  in  the  operation  of 
the  FIST  DMD  by  an  instructor  from  the  Gunnery  Department  of 
the  US  Army  Field  Artillery  School  at  the  Human  Engineering 
Laboratory,  The  Human  Engineering  Laboratory  will  provide 
training  equipment  for  the  students.  The  test  participants 
are  trained  Fire  Support  Teams. 
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D.  Input  Data  Base 

The  tactical  scenario  data  base  contains  all  fire 
support  control  messages  for  a  limited  scenario  of  a 
mechanized  infantry  battalion  of  an  armored  division.  The 
SCORES,  Europe  III,  Sequence  2A  was  used  to  generate  targets 
expected  to  be  fired  by  a  battalion  in  sustained  combat 
operation.  The  battalion  is  constrained  by  ammunition 
resupply  under  normal  operations,  however,  it  was  decided 
that  ammunition  resupply  should  not  be  a  limiting  condition 
in  this  test.  The  entire  scenario  is  played  in  retrograde 
mode  , 


The  data  base  consists  of  36  two  hour  cells  of 
messages,  12  two  hour  cells  of  low  intensity,  12  two  hour 
cells  of  medium  intensity  and  12  two  hour  cells  of  high 
intensity.  Intensity  is  defined  by  the  number  of  initiating 
messages  per  two  hour  cell  in  Figure  6  and  the  message 
stream  that  follows  each  initiating  message  is  given  in 
Figure  7*  It  can  be  seen  that  intensity  is  a  function  of 
the  number  of  initiating  messages  and  their  subsequent 
messages.  The  36  two  hour  cells  of  data  are  arranged  such 
that  all  permutations  of  the  three  intensities  (L-M-H) 
appear  twice.  Ninety  percent  of  the  fire  missions  will  have 
normal  priority  and  the  other  ten  percent  urgent  priority. 


V,  DATA  COLLECTION. 


A.  Experimental  Design 
1.  Factors. 

The  two  factors  that  will  be  tested  in  this  experiment 
are  message  intensity  and  communications  reliability.  Three 
levels  of  message  intensity  will  be  tested  with  each  of 
three  levels  of  communication  reliability  giving  nine  test 
combinations.  Degradation  of  messages  is  total.  That  is, 
1 5 X  degradation  indicates  1 5 X  of  the  messages  are  lost  in 
their  entirety.  The  levels  of  each  factor  will  be  defined 
as  follows: 

Message  Intensity 
L  z  low 
M  =  medium 
H  =  high 

Communications  Reliability 
0  =  0%  degradation 
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1  =  15?  degradation 

2  =  30?  degradation 


2.  Assumptions, 

For  measures  of  performance  1,2, 3» 6  and  7  it  is  assumed 
that  these  measures  are  independent  and  normally  distributed 
random  variables  and  that  all  observations  of  a  given  MOP 
have  the  same  variance.  Only  the  assumption  of  independent 
random  variables  is  made  for  the  remaining  measures  of 
performance  . 

3,  Design  Matrix. 

Since  the  testing  of  all  nine  treatment  combinations 
require  a  minimum  of  18  hours  of  testing  and  realistically 
could  not  be  completed  in  one  day,  a  randomized  incomplete 
block  design  was  constructed  as  shown  in  Figure  8  so  that 
the  day-to-day  variability  would  not  influence  our  results. 
The  nine  treatment  combinations  were  divided  into  blocks  of 
three  and  the  three  blocks  were  run  over  a  three  day  period 
in  a  random  order.  The  experiment  will  be  repeated  for  each 
of  the  four  FISTs  so  that  an  unbiased  estimate  of  error  can 
be  obtained.  In  addition,  a  comparison  of  the  performance 
of  the  four  FIST  HQs  can  be  made  since  each  team  is  tested 
under  all  possible  treatment  combinations  twice. 

The  assignment  of  the  treatment  combinations  into 
blocks  was  based  on  a  confounding  scheme.  This  scheme 
assures  that  the  effects  of  message  intensity  (I)  and 
communication  degradation  (C)  and  the  interaction  of  these 
two  factors  (I  x  C)  on  a  FIST  HQ*s  ability  to  perform  fire- 
support  coordination  can  be  measured.  Because  time 
constraints  permit  only  two  replications,  part  of  the 
precision  of  the  estimate  of  the  interaction  was  sacrificed 
(i.e.  blocks  within  replicate  1  are  confounded  with  the 
linear  component  of  the  I  x  C  interaction  and  blocks  within 
replicate  2  are  confounded  with  the  quadratic  component  of 
the  I  x  C  interaction).  Randomization  of  treatment 
combinations  within  blocks  and  blocks  within  days  has  been 
performed,  hence,  the  test  will  be  run  in  the  order  shown  in 
the  design  matrix,  Figure  8. 

B.  Questionaires 

Quest ionaires  will  be  administered  at  the  end  of  the 
FIST  DM  D  training  and  at  the  end  of  the  test.  The 
questionaires  were  developed  by  Mr.  Leonard  Cunningham  and 
Major  Grim  of  the  Field  Artillery  Board  for  the  FIST  DM D 
Force  Development  Testing  and  Experimentation  (FDTE). 
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VI.  DATA  ANALYSIS 


A.  Statistical  Analysis 

The  data  analysis  will  be  based  on  a  incomplete  block 
design.  Analysis  of  variance  or  a  two-way  classification 
table  analysis,  frequently  called  a  contingency  table 
analysis,  will  be  the  methods  of  analyses  performed.  The 
null  hypotheses  to  be  tested  are: 

1)  There  is  not  a  statistically  significant  difference 
between  the  message  intensity  levels  as  measured  by  the 
stated  measures  of  performance. 

2)  There  is  not  a  statistically  significant  difference 
between  the  degradation  levels  as  measured  by  the  stated 
measures  of  performance* 

3)  There  is  not  a  statistically  significant  interaction 
effect  between  message  intensity  and  communication 
degradation. 

An  analysis  of  variance  (ANOVA)  will  be  used  to  test 
the  above  hypotheses  for  MOPs  1,2, 3*6  and  7.  However,  since 
this  analysis  is  sensitive  to  departures  from  the  assumption 
of  equal  variances,  a  check  will  be  made  to  assure  that  this 
assumption  is  valid.  An  appropriate  transformation  will  be 
performed  on  those  MOPs  that  depart  from  this  assumption  to 
assure  the  validity  of  the  analysis.  The  analysis  of 
variance  table  for  this  design  is  presented  in  Figure  9. 
The  error  term  will  be  obtained  by  pooling  the  FIST, 
replicate,  and  day  interactions  after  a  check  has  been  made 
to  assure  that  these  effects  are  not  significant. 

A  contingency  table  analysis  will  be  developed  for  MOPs 
4  and  5*  The  chi  square  statistic  will  be  used  to  test  the 
above  hypotheses. 

B.  Subjective  Analysis 

Questionaires  will  be  summarized  to  provide  feedback  on 
the  experiment,  the  CPX  Research  Facility,  training  and 
equipment . 
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BRIGADE-AREA  TACFIRE  SYSTEM 
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FIGURE  5.  Sample  ACE  Display  (ADIS) 


FACTORS 


1)  INTENSITY  (per  two  hour  block) 
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12 

6 

1 
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FIGURE  6 


749 


750 


DESIGN  MATRIX 

FIST 

|  REP! 

REP2 

TEAM 

DAYl 

DaY2 

DAY3 

DAY4 

DAYS 

DAY6 

L2 

MO 

LO 

LO 

HO 

TEAM 

Ml 

H2 

HI 

Ml 

LI 

ONE 

HO 

LI 

M2 

H2 

M2 

HI 

H2 

L 2 

Ml 

1 

HI 

TEAM 

L0 

MO 

HO 

LO 

MO 

TWO 

M2 

LI 

Ml 

H2 

HI 

L2 

M2 

H2 

Ml 

LO 

HI 

M2 

TEAM 

HI 

MO 

L2 

Ml 

MO 

HO 

THREE 

LJ0 

LI 

HO 

H2 

L2 

LI 

H2 

Ml 

M2 

HI 

LI 

LO 

TEAM 

MO 

L2 

HI 

MO 

HO 

Ml 

FOUR 

LI 

HO 

LO 

L2 

M2 

H2 

COMMUNICATION 
INTENSITY  DEGRADATION 


L*=  LOW 
M"=  MEDIUM 
H—  HIGH 


0=  03%  DEGRADATION 
1-  15%  DEGRADATION 
2“  30%  DEGRADATION 


FIGURE  8 


751 


ANALYSIS  C 

(AN 

IF  VARIANCE 

OVA) 

SOURCE 

DEGREES 

OF 

FREEDOM 

FIST 

3 

Rep 

1 

Block  within 
Rep 

4 

Intensity 

2 

Degradation 

2 

Intensity 

X 

Degradation 

4 

Pooled  error 

55 

Total 

71 

FIGURE  9 
752 


MOVING  FINITE  ELEMENT  RESEARCH  FOR 
SHOCK  HYDRODYNAMICS,  CONTINUUM  MECHANICS  AND  COMBUSTION 


Robert  J.  Gellnas 
Said  K.  Doss 
Neil  N.  Carlson 
Science  Applications,  Inc. 
1811  Santa  Rita  Road,  Suite  104 
Pleasanton,  California  94566 


ABSTRACT.  The  overall  objective  of  this  research  is  to  investigate  the 
nunerical” properties  and  structure  of  the  moving  finite  element  (MFE)  method 
in  order  to  reduce  it  to  practice  for  the  numerical  solution  of  important  PUE 
systems.  This  research  focusses  upon  mathematical  and  computational  proper¬ 
ties  of  transient  MFE  solutions  in  1-D  and  2-D  of  (i)  the  full  viscous,  com¬ 
pressible  Navier-Stokes  equations  for  shocks  and  possibly  for  combustion  pro¬ 
cesses  in  gases,  and  (ii)  the  continuum  equations  for  impacts  of  initially 
solid  bodies,  where  constitutive  models  include  elastic,  plastic,  and  visco 
plastic  effects.  In  this  work,  primary  attention  is  devoted  to  the  distinc¬ 
tion  and  exacting  resolution  of  actual  physical  dissipation  effects  (over 
highly  disparate  scales)  vis  a  vis  numerical  dissipation  effects  which  fre¬ 
quently  obscure  the  actual  physical  dissipation  processes  in  PDE  solutions  of 
fluid  dynamics  equations.  Test  cases  which  demonstrate  these  distinctions 
are  presented,  and  those  factors  which  are  major  determinates  of  grid  node 
optimality  in  the  MFE  method  and  in  certain  other  adaptive  solution  methods 
for  PDE's  are  discussed. 

INTRODUCTION.  The  moving  finite  element  (MFE)  method  is  a  promising  new 
approach  for  solving  nunerically  the  partial  differential  equations  (PDE's) 
of  hydrodynamics,  continuum  mechanics,  combustion,  and  other  transport  equa¬ 
tion  systems.  In  the  MFE  method,  grid  coordinates  themselves  are  dependent 
variables  which  are  calculated  continuously  at  each  time  step  in  order  to 
minimize  PDE  residuals.  This  feature  has  successfully  suppressed  numerical 
dissipation  to  very  low  levels  which  has  resulted  in  the  accurate  resolution 
in  1-D  shocks  of  physical  transport  and  dissipation  effects  which  are  con¬ 
tained  in  the  full  Navier-Stokes  equations.  It  has  been  shown  in  recent  work 
that:  (i)  the  MFE  method  does  extend  logically  and  practically  to  2-D  and 
(ii)  extensive  research  in  several  new  areas  must  be  carried  out  in  order  to 
realize  the  full  potential  of  the  MFE  method. 

This  paper  reviews  progress  in  1-D  and  2-D  MFE  research  and  indicates 
areas  where  applied  mathematics  research  will  continue  to  advance  progress  in 
MFE  developments. 

1.  Steady  State  Shock  Structure  Calculations.  Until  recently,  neither 
computer  hardware  nor  numerical  solution  methods  for  partial  differential 
equations  (PDE's)  could  real istically  be  expected  to  resolve  the  highly  dis¬ 
parate  physical  scales  of  physical  dissipation  processes  in  shocked  fluids. 
However,  the  MFE  method  has  recently  shown  promise  in  1-D  of  resolving  shock 
structures  according  to  the  actual  physical  dissipation  processes  and  thermal 
conduction  which  occur  in  nature.  The  resolution  of  such  highly  disparate 
scales  is  especially  important  in  transient  systems  where  shocks,  contact 
surfaces,  and  other  fluid  structures  may  interact.  An  essential  benchmark 
is  a  verification  that  the  MFE  method  does,  in  fact,  reproduce  those  shock 
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structures  for  which  analytic  solutions  are  known.  Once  such  example  is  the 
steady  state  solution  of  the  full  Navier-Stokes  equations  for  a  freely  propa¬ 
gating  shock  In  an  ideal  gas.  Recall  that  the  Navier-Stokes  equations  are 
derived  as  the  first  approximation  of  kinetic  theory  which  is  outlined  for 
1-D  systems  in  Equations  ( 1 ) - ( 9 )  below.  From  kinetic  theory,1 


i'i  <»«>■» 

(1) 

'ov2' -  Sp[0>  *p[1>  *-[2’ 

(2) 

(3) 

A  zero-th  approximation  of  the  kinetic  theory  for  gases  uses 
tutive  relations  for  an  inviscid,  non-conducting  fluid;  i.e,. 

the  consti 

(4) 

-O 

II 

o 

II 

o 

(5) 

These  relationships  yield  the  well-known  Euler  equations. 


A  first  approximation  of  the  kinetic  theory  for  gases  gives  the  Navier- 
Stokes  equations  according  to: 


/  1  2.4  av 

(7  -  1)(E  -  ?pv  )  +  3^ 


(6) 


q 


(7) 


and  a  second  approximation  of  kinetic  theory  gives  the  Burnett  equations. 


(8) 


,  etc. 


(9) 


The  analytic  solutions  of  the  steady  state  Navier-Stokes  equations  are 
readily  obtained  and  can  be  found  in  Reference  2.  Using  these  analytic 
results,  we  can  consider  comparisons  between  the  MFE  solution,  the  exact 
analytic  solution,  and  an  anomolous  numerical  diffusion  solution  for  the 
shock  test  problem  which  has  been  considered  previously  by  Sod.3  Using  the 
notation  m  =  pv,  the  initial  conditions  are: 

m(x,0)  =  0 
p(x,0)  =  1 
E(x,0)  =  2.5 

p(x,0)  =  p(x,Q)  =  linear 
E(x,0)  =  linear 
p(x,G)  =  0.125 
E(x,0)  =  0.25  , 


0  <  x  <  1.0 
0  <  x  <  0.500 

0,500  <_  x  £  0.501  (10) 

0.501  <  x  <  1.0 
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Relative  Fluid  Velocity  Versus  Position 


Figure  1.  MFE  solution  of  Navier-Stokes  equation  for  G.  Sod's  test  problem, 


and  Dirichlet  boundary  conditions  are  used  at  x  =  0  and  x  =  1.0. 

Figure  1  presents  comparative  results  for  the  shape  of  the  shock  front  in 
normalized  units  during  free,  steady  state  propagation  at  t  ■  0.15.  It  is 
seen  that  the  MFE  solution  of  the  Navier-Stokes  equations  (solid  dots),  which 
used  only  19  grid  nodes,  compare  very  well  with  the  analytic  solution  (solid 
line).  As  a  numerical  experiment,  one  can  ask:  How  sensitive  are  the  solu¬ 
tions  of  the  this  problem  to  some  anomolous  or  artificial  dissipation  pro¬ 
cess?  To  address  this  question,  we  replaced  the  correct  physical  viscous 
dissipation  terms  in  the  Navier-Stokes  equations  by  a  linear  dissipation 
operator  (essentially  a  linear  Q  model,  which  is  frequently  used  in  order  to 
stabilize  some  other  PDE  solution  methods,  with  the  hope  that  the  physical 
solutions  of  the  true  Navier-Stokes  equations  would  not  be  altered  too 
greatly  by  the  more  more  convenient  linear  dissipation  model).  This  numerical 
experiment  was  also  performed  with  the  linear  dissipation  term  appearing  in 
the  density  equation,  as  well  in  the  momentum  equation,  as  a  sensitivity  test 
of  the  possible  effects  of  uncontrolled  numerical  diffusion  which  is  intrin¬ 
sic  in  any  PDE  method  when  the  grid  mesh  locations  are  not  optimal.  It  was 
found  that  using  the  same  value  of  v  =  0.0001  in  the  uncontrolled,  or  anomo¬ 
lous,  dissipaton  model  as  was  used  in  the  true  Navier-Stokes  calculations, 
the  computed  shock  shape  and  width  from  the  anomolous  dissipation  model  were 
greatly  in  error.  We  then  asked:  Can  the  value  of  v  in  the  anomolous  dissi¬ 
pation  model  be  selected  somehow  arbitrarily  so  as  to  achieve  agreement 
between  the  Navier-Stokes  solutions  and  the  linear  dissipation  model  solu¬ 
tions?  It  was  found  that  the  width  of  the  shock  calculated  by  the  anomolous 
dissipation  model  could  be  made  to  approximate  the  width  of  the  shock  in  the 
Navier-Stokes  solutions  by  increasing  the  value  of  v  by  a  factor  of  five  in 
this  specific  case.  But  it  was  not  possible  to  reproduce  the  correct  Navier- 
Stokes  shock  shape,  as  can  be  seen  from  the  open  circles  in  Figure  1,  by  any 
of  our  attempted  adjustments  of  v  in  the  anomolous  dissipation  model. 

It  is,  of  course,  recognized  that,  in  many  practical  examples,  it  may 
not  be  necessary  to  know  the  correct  shock  structure.  We  therefore  extended 
this  initial  investigation  of  steady  state  shock  effects  in  order  to  deter¬ 
mine  if;  when,  and  how  shock  structure  effects  may  become  important  in  prac¬ 
tical  applications.  A  first  extension  continued  the  present  MFE  calculations 
of  this  test  example  through  many  shock  reflections.  These  results  have  been 
reported  In  Reference  4.  It  was  found  that  the  MFE  solutions  of  the  Navier- 
Stokes  equations  can  accurately  resolve  the  repeated  mutual  interactions  and 
reflections  of  shocks,  contact  surfaces  and  rarefactions  and  that  this  prob¬ 
lem  can  be  solved  all  the  way  to  Its  final  equilibrium  state.  It  is  clearly 
evident  in  these  transient  results  that  the  accurate  resolution  of  the  actual 
physical  dissipation  processes  over  highly  disparate  scales  in  the  Navier- 
Stokes  equations  is  essential  for  practical  applications  in  which  shocks  are 
not  simply  in  a  state  of  free  propagation.  That  is,  the  strength  of  inter¬ 
actions  between  fluid  structures  and  thus  the  time  required  to  reach  equili¬ 
brium  are  sensitive  to  both  the  magnitude  and  operational  descriptions  of 
these  physical  dissipation  processes.  For  example,  model  thermal  conductiv¬ 
ities  which  are  larger  than  their  appropriate  physical  magnitudes  would  have 
the  effect  of  broadening  contact  discontinuities  too  much,  too  soon;  and 
ensuing  interactions  between  shocks  and  such  broadened  contact  discontinui¬ 
ties  would  drive  the  system  to  equilibrium  too  soon.  Conversely,  model 
thermal  conductivities  which  are  too  small  maintain  contact  discontinuity 
gradients  at  values  which  are  larger  than  their  appropriate  physical  magni¬ 
tudes  until  late  times,  and  the  corresponding  evolution  toward  equilibrium 
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would  then  be  slower  than  the  appropriate  physical  evolution.  These  features 
can  be  seen  in  the  results  for  both  this  moderate  shock  example  and  for  the 
well-known  problem  of  anomalous  wall-heating  which  is  considered  next. 

2.  The  Anomolous  Wall-Heating  Problem.  Anomolously  high  temperatures 
frequently  occur  in  computations  of  shock  reflections  from  an  Infinitely 
reflecting  wall  in  slab  geometry  or  from  the  reflection  of  an  imploding  shock 
at  the  origin  in  cylindrical  or  spherical  geometries.  The  MFE  results  which 
follow  indicate  the  anomolous  aspects  are  eliminated  when  the  physical  dissi¬ 
pation  processes  in  the  Navier-Stokes  equations  are  accurately  resolved  in 
the  transient  reflection  process.  We  consider  the  following  test  problem  in 
1-D  slab  geometry: 


Initial  Conditions: 


p(x,  0)  =  1 

0.  < 

p(x,  0)  =  e(x,  0)  =  0 

0.  < 

v(x,  0)  =  -1 

Ax0  < 

v(x.  0)  =  linear 

0  ? 

O 

II 

O 

A 

X 

> 

x  * 

Y  =  5/3 

x  £  2. 
x  £  2.0 
x  £  2.0 
x  £  Ax0 
0. 


Boundary  Conditions: 

Reflection  at  x  -  0. 

Dirichlet  at  initial  values  at  x  =  2.0 


Rankine-Hugoniot  Solutions  for  Infinite  Shock  (t  fr) : 


s  =  1/3 
p+  =  4.0 
e+  =  0.5 
v+  *  0. 
p+  =  1.33 


1.0 

0. 

-1. 

0. 


The  time  evolution  of  this  shock  was  solved  by  the  MFE  method  in  two 
ways:  First  the  full  Navier-Stokes  equations  were  solved  with  alternative 
values  of  v  =  4u/3  and  k.  These  solutions  are  denoted  by  N-S  in  the  accom¬ 
panying  figures.  In  one  set  of  N-S  solutions,  v  =  k  =*  0.01,  which  is  unreal¬ 
istically  large  but  which  permits  comparisons  to  other  fixed  node  PDE  solu¬ 
tions  that  may  use  on  the  order  of  100  to  several  hundred  grid  nodes.  In 
another  set  of  N-S  solutions,  v  =  ic  *  0.001,  which  approximates  physically 
realistic  values  for  actual  dissipation  processes  in  gases.  Second,  the 
variable  e  denotes  internal  energy  per  unit  mass  in  the  accompanying  figures, 
and  anomolous  diffusion  (denoted  by  A  in  the  accompanying  figures)  was  simu¬ 
lated  by  including  a  diffusion  term,  vrpxx,  in  Equation  (1).  This 
effectively  simulates  some  form  of  uncontrolled  numerical  diffusion  which 
is  present  intrinsically  in  many  alternative  PDE  methods.  (Such  anomolous 
diffusion  can  find  its  way  into  alternative  PDE  methods  by  numerous  and 
various  means). 


At  t  =  0+,  the  shock  incident  on  the  origin  is  in  the  incipient  state 
of  outward  reflection.  At  t  =  0.05,  Figures  2  and  3  show  that  the  calcula¬ 
tions  of  the  reflected  shock  with  uncontrolled  diffusion  (or  simulated  numer¬ 
ical  diffusion)  tend  inmediately  to  overheat  in  e  and  to  correspondingly 
undershoot  in  p  relative  to  the  Navier-Stokes  solutions.  Although  these 
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transient  solutions  are  not  near  their  steady  state  values  at  this  early 
time,  it  will  be  seen  that  the  ensuing  evolution  toward  equilibrium  is  quite 
sensitive  to  both  the  magnitudes  and  the  nature  of  the  dissipation  processes 
in  the  computations. 

Figure  4  shows  that  at  t  -  0.15,  the  lip  of  the  shock  in  the  Navier- 
Stokes  solution  is  approaching  the  steady*  state  value  of  p  =  4.0,  and  the 
anomolous  dissipation  solution  lags  by  a  significant  margin.  The  fluid 
buildup  at  the  front  of  the  shock  is  evident  here  because  the  fluid  near  the 
origin  has  stagnated  while  additional  fluid  continues  to  stream  in  toward  the 
origin  from  the  region  to  the  right  of  the  shock.  Figure  5  shows  that  the 
anomolous  dissipation  results  continue  to  lag  behind  the  Navier-Stokes  solu¬ 
tions  to  a  significant  degree  at  t  =  0.300.  At  t  =  2.0,  the  Navier-Stokes 
solutions  have  approached  steady  state  Rankine-Hugoniot  conditions  (not  shown 
in  Figure  6),  and  the  anomolous  dissipation  solution  has  "still  not  reached 
the  Rankine-Hugoniot  values  in  the  vicinity  of  the  origin.  The  anomolous 

wall  heating  effects  due  to  uncontrolled  dissipation  in  the  density  equation 
have  thus  persisted  to  very  long  times  vis  a  vis  the  accurate  solutions  of 
the  Navier-Stokes  equations.  Non-physical  dissipation  operators  in  fluid 

calculations  can  be  shown  to  have  similar  effects. 

Figures  7  and  8  present  the  results  of  another  test  of  sensitivity  of 
the  Navier-Stokes  solutions  to  non-optimal  grid  locations.  In  this  test 
problem,  a  physically  realistic  value  of  v  =  0.0001  is  used  in  MFE  solutions 
of  the  Navier-Stokes  equations.  We  have,  however,  deliberately  constrained 
the  MFE  grid  nodes  in  this  test  case  so  that  they  do  not  migrate  to  their 

truly  optimal  locations,  as  in  the  results  considered  previously.  Figure  7 

shows  several  significant  features:  (i)  the  shock  gradients  associated  with 

0.0001  are  extremely  large;  the  accurate  resolution  of  these  gradients 
would  require  several  thousand  nodes  if  a  fixed  node  PDE  solution  method  were 
to  be  used,  (ii)  the  Rankine-Hugoniot  solutions  are  approached  much  more 
rapidly  for  the  physically  realistic  values  of  v  than  for  the  larger  values 
of  which  are  typically  used  either  tacitly  or  explicitly  in  many  other  PDE 
solution  methods,  and  (iii)  the  slight  constraint  on  node  movements  and  thus 
on  nodal  positions  do  not  show  up  immediately,  but  once  the  perturbation 
becomes  significant  (as  seen  in  Figure  8),  its  effects  can  grow  rapidly.  In 
summary,  these  results  demonstrate  that  reflected  shock  solutions  can  be  very 
sensitive  to  non-physical  dissipation  effects  and  to  slight  deviations  from 
optical  grid  node  positioning,  even  in  adaptive  gridding  methods.  All  of  the 
results  in  this  section  were  obtained  with  approximately  30  MFE  nodes.  As 
many  as  61  MFE  nodes  were  used  to  verify  that  the  MFE  solutions  were  in  fact 
converged  solutions.  In  the  absence  of  the  stringent  tests  of  convergence 
which  were  applied  here,  it  can  be  extremely  difficult  to  discern  physical 
oscillations  and  dissipation  effects  from  non-physical  and/or  purely  numeri¬ 
cal  oscillations  and  dissipation  effects. 

3.  Burger's  Equation  in  2-D.  The  PDE’s  for  this  skewed  Burger's  model 
problem  are  given  by: 

ut  =  -uux  -  vuy  +  v(uxx  +  Uyy )  (11) 

vt  =  -UVX  -  vvy  +  v(vxx  +  vyy)  (12) 

where  u  is  the  x-component  of  velocity  and  v  is  the  y-component,  and  v  is 
an  effective  diffusion  coefficient.  Shocks  are  generated  with  gradient 
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Figure  7.  MFF  solutions  of  the  Navier-Stokes  equations  with 
slightly  non-optimal  node  positions. 


Figure  8.  MFE  solutions  of  the  Navier-Stokes  equations  with 
slightly  non-optimal  node  positions. 


magnitudes  on  the  order  of  1/v.  Initial  conditions  which  produce  a  doubly 
skewed  wavefront  profile  are  shown  schematically  in  Figures  9  and  10.  (The 
counter posed  initial  velocity  fields  are  designed  to  create  an  evolving  shock 
profile  which  is  skewed  in  both  the  x-  and  y-components  of  velocity.)  Bound¬ 
ary  conditions  are  given  by: 


u(0,y) 

vx(o.  y) 

u(x, 
u(x, 
v(x, 
v(x, 


1) 

0) 

1) 

0) 


u(l,y)  =  0 

vxd.  y)  =  0. 

0.2  sin  ttx 
-.2  sin  ttx 
-1 .  +  0.2  cos  ttx 
1 .  +  0.2  cos  ttx 


0.  <  y  <  1. 

0.  <  y  <_  1. 

o.3x<i. 

0.  <  x  <  1. 

0.  1  x  1  1  • 

0.  <  x  <  1  . 


The  MFE  nodes  are  fixed  by  zero-Dirichlet  conditions  along  the  top  and 
bottom  horizontal  edges  of  the  domain.  The  nodes  are  free  to  move  vertically 
by  symmetric  boundary  conditions  along  the  left  and  right  edges  of  the 
domain. 


This  problem  can  be  solved  readily  by  perhaps  many  PDE  solution  methods 
whenever  v  assumes  sufficiently  large  values.  For  example,  a  value  of  v  = 
0.02  produces  shock  gradients  on  the  order  of  102. 

The  MFE  method  requires  only  an  8  x  8  grid  to  give  reasonably  accurate 
solutions  to  this  problem,  and  Figures  11  and  12  show  accurate  MFE  solutions 
on  a  12  x  12  grid.  Here  Figure  11  presents  an  isometric  view  of  the  evolving 
profile  of  the  y-component  of  velocity  at  t  =  3.0,  well  after  the  shock  has 
formed  and  after  the  wavefront  has  undergone  significant  shearing.  The  x- 
component  of  velocity  is  sufficiently  sheared  that  a  hidden  line  plot,  which 
is  not  yet  available,  is  required  for  easy  interpretation  by  the  naked  eye. 
The  MFE  grid  nodes  have  migrated  extensively  from  their  initial  positions  as 
can  be  seen  in  Figure  12  which  represents  the  grid  mesh  projected  onto  the 
x-y  plane  at  t  =  3.0.  Figures  13  and  14  present  contour  plots  for  selected 
constant  values  of  u  and  v,  respectively,  at  t  -  3.0.  It  is  evident  from  the 
magnitudes  of  shock  gradients  and  from  the  regions  of  significant  curvature 
which  span  nearly  the  entire  domain  that  an  alternative  PDE  method  with  a 
fixed  grid  may  require  on  the  order  of  104,  or  more,  grid  nodes  in  order  to 
achieve  comparable  degrees  of  accuracy  in  this  problem. 

This  same  basic  problem  can  now  be  made  to  correspond  to  a  much  more 
demanding  physical  problem  by  letting  v  =  0.002,  Figure  15  shows  an  isometric 
view  of  the  MFE  solution  on  a  16  x  16  grid  for  this  case.  Shock  gradients 
are  now  generated  with  magnitudes  of  several  times  10^,  Before  discussing 
these  MFE  results  in  detail,  some  general  observations  should  be  discussed: 
It  is  extremely  unlikely  that  any  existing  PDE  method  using  either  a  fixed 
grid  or  a  less  than  optimal  adaptive  grid  can  accurately  solve  this  test  pro¬ 
blem  with  fewer  than  105-106  grid  nodes.  Grid  aspect  ratios  frequently 
assume  very  large  values  (102-103),  It  should  also  be  noted  here  that 
numerous  inviscid  solvers  which  are  under  development  do  not  apply  at  all  to 
this  type  of  advecti on-diffusion  problem  because  the  Laplacian  is  an  essen¬ 
tial  mathematical  operator  whose  effects  must  be  rigorously  resolved  in 
advecti on-diffusion  PDE's.  Because  inviscid  solvers  do  not  generally  solve 
PDE's  which  contain  Laplacians,  they  generate  shocks  with  gradient  shapes  and 
magnitudes  that  are  governed  exclusively  by  the  grid  spacing  and/or  by  the 
purely  numerical  dissipative  processes  In  the  inviscid  method,  per  se.  Con¬ 
sequently,  inviscid  solvers  have  almost  no  chance  of  resolving  those  physical 
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Figure  11.  Isometric  view  of  v  at  t  =  3,0  in  the  2-D  Burger-like 
example  on  a  12  x  12  MFE  grid. 
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Figure  12.  MFE  grid  projections  on  the  x-y  plane  at  t  «  3.0  in 
the  2-D  Burger-like  example  on  a  12  x  12  grid. 


768 


Figure  13.  Contour  plots  of  selected  values  of  u  at  t 
Burger-like  example  on  a  12  x  12  grid. 
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Figure  14.  Contour  plots  of  selected  values  of  v  at  t  »  3.0  In  the 
2-D  Burger-like  example  on  a  12  x  12  grid. 
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Figure  15.  Isometric  view  of  Burger's  test  example  at  t  =  1,8  with 
v  =  0.002  on  a  16  x  16  MFE  mesh.  Shock  gradients  have 
magnitudes  of  approximately  IQ3  in  this  solution  for  u. 


dissipation  effects  which  are  usually  expressed  by  Laplacian  operators  and  are 
present  with  fundamental  physical  significance  in  transport  theory,  hydrody¬ 
namics,  plasma  physics,  continuum  mechanics,  and  many  other  disciplines  in  the 
physical  sciences.  This  critical  discussion  is  not  intended  to  denigrate  the 
extensive  research  efforts  on  inviscid  PDE  solvers  and/or  fixed  node  POE 
methods  where  they  legitimately  apply;  but  it  does  suggest  that  efforts  to 
accommodate  Laplacian  operators  in  otherwise  inviscid  solution  methods  and 
efforts  to  investigate  more  optimal  adaptive  grid  methods  for  use  in  many 
existing  PDE  methods  which  are  applied  to  advectiondif fusion  problems  should 
now  assume  greatly  increased  significance.  In  the  meantime,  the  MFE  method  is 
providing  various  clues  to  some  of  the  significant  new  areas  where  mathematics 
research  can  profitably  be  intensified,  as  will  be  indicated  below. 
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Early  MFE  results  in  2-D  are  apparently  continuing  the  trend  which 
appeared  in  previous  1-D  results.  There,  we  saw  that  MFE  solutions  of  both 
the  Navier-Stokes  and  physically  dissipative  continuum  mechanics  equations  in 
1-D  exhibited  extremely  high  levels  of  simultaneous  resolution  of  extremely 
disparate  microscale  and  macroscale  physical  processes.  While  current  2-D 
MFE  results  exhibit  similar  promising  features,  numerous  mathematical  prob¬ 
lems  still  require  resolution  in  order  to  attain  fully  the  desired  levels  of 
success  in  truly  large-scale  problems  in  2-D.  Clues  to  these  problem  areas 
can  be  seen  in  Figure  15.*  For  example,  the  irregularity  of  the  grid  tri¬ 
angles  in  the  face  of  the  shock  could  eventually  prove  to  be  troublesome. 
Similarly,  the  small  oscillation  at  the  base  of  the  shock  in  this  run  is 
unsatisfactory,  even  though  It  can  be  eliminated  in  any  number  of  ways. 
Extensive  testing  and  analysis  has  indicated  that  the  causal  mechanisms 
underlying  such  mesh  irregularities  and  oscillations  in  2-D  can  be  associated 
with:  (i)  time  step  and  error  control  policies  in  the  basic  ODE  integrator 
of  Gear  which  is  presently  used,  (ii)  convergence  properties  of  the  linear 
solver,  and  (iii)  limitations  in  the  first- generation  regularization  func¬ 
tions.  Each  of  these  areas  are  worthy  of  continued  Intensive  investigation. 

4.  Conclusions  and  Future  Work. 

•  The  MFE  method  can  now  be  considered  for  large-scale  applications  in  2-D. 
Applications  of  the  2-D  MFE  method  to  airblast  effects  and  structural 
impacts  in  continuum  mechanics  are  in  their  early  developmental  phases. 

•  It  is  apparent  that  extensive  new  research  is  needed  in  numerous  areas  of 
applied  mathematics  in  order  for  the  MFE  and  other  related  adaptive  grid 
PDE  methods  to  reach  their  ultimate  potential.  Some  of  the  key  topical 
areas  for  additional  research  are: 

-  ODE  integrators  for  PDE  applications. 

-  Linear  systems  solvers  for  non- symmetric  matrices  which  are  not  diagon¬ 
ally  dominant.  (Even  the  most  advanced  linear  solvers  for  symmetric, 
diagonally  dominant  matrices  converge  too  slowly  for  use  in  difficult 
advecti on-diffusion  problems.)  We  have  one  new  concept  for  such  highly 
skewed  matrix  systems  under  development  which  is  showing  significant 
promi se . 

-  Boundary  conditions  for  arbitrary  domains. 

-  Alternative  MFE  basis  functions. 

-  Alternative  co-ordinate  systems.  (The  MFE  does  not  suffer  from  singu¬ 
larities  at  the  origin  in  cylindrical  and  spherical  co-ordinates.) 

-  Alternative  regularization  schemes.  (These  powerful  schemes  are  only 
now  starting  to  be  used  in  PDE  solution  methods.) 

The  present  research  has  made  sufficient  early  advances  in  most  of  these 
areas  to  indicate  the  vast  potential  which  lies  ahead  as  new  advances  are 
made. 


*It  is  apparent  that  these  suggested  mathematical  problems  will  have  to  be 
resolved  not  only  for  the  MFE  method  but  also  for  most  other  advanced  PDE 
methods  which  may  seek  to  solve  the  difficult  advectlon-dif fusion  equations 
which  frequently  arise  in  physical  problems. 
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ABSTRACT .  The  formalism  for  a  complete  treatment  of  elastic 
wave  Scattering  from  cylindrical  cavities  and  solid  inclusions  has 
been  worked  out  for  both  perpendicular  and  oblique  indice.  Special 
cases,  e.g.,  fluid  cylinders  enclosed  in  fluid  or  solid  media,  have 
already  been  numerically  analyzed.  Poles  of  scattering  amplitudes 
in  the  complex  frequency  plane  have  been  found,  and  were  physically 
interpreted  in  terms  of  helical  surface  waves  on  the  cylinders 
propagating  in  both  the  interior  and  the  exterior  medium.  Disper¬ 
sion,  attenuation  and  refraction  of  these  surface  waves  have  been 
obtained. 


I.  INTRODUCTION.  We  consider  the  problem  of  scattering  of 
obliquely  incident  elastic  or  acoustic  waves  from  the  surface  of 
an  infinite  circular  cylinder.  There  are  four  cases  of  interest: 

(i)  inside  solid,  outside  solid 

(ii)  inside  solid,  outside  fluid 

(iii)  inside  fluid,  outside  solid 

(iv)  inside  fluid,  outside  fluid 

A  complete  analytical  solution  for  case  (i)  was  first  obtained  by 
White  [l] ,  whose  work  was  later  corrected  by  Lewis  and  Kraft  [2], 

The  second  case  was  discussed  by  Flax,  Varadan  and  Varadan  [3]  , 
and  the  appearance  of  resonance  effects  in  the  numerically  calcu¬ 
lated  scattering  cross  section  was  noted.  Case  (iii)  was  treated 
by  Solomon  et  al  [4  ]  for  normal  incidence.  The  present  authors 
discuss  in  a  recent  paper  [  5  ]  the  refraction  effects  which  take 
place  upon  the  generation  of  surface  waves  in  case  (iv). 

In  the  present  paper  we  develop  a  general  formalism  that 
covers  the  four  cases  mentioned.  This  includes  both  formulas  for 
the  scattering  amplitudes  of  the  resonances  and  an  analysis  of 
their  Watson  poles  in  the  complex  frequency  plane  and  their  physical 
interpretation.  We  introduce  a  compact  notation  and  a  transforma¬ 
tion  gauge  which  simplify  the  solution  considerably,  making  it 
suitable  for  computer  coding. 
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As  in  White  [l]  we  call  1  the  inside  region  and  2  the 
outside  region.  We  also  use  arabic  subscripts  (1,2)  for  longi¬ 
tudinal  or  compressional  waves  (p-waves)  and  roman  subscripts 
(I, II)  for  transverse  or  shear  waves  ( s -waves )  .  Therefore,  in 
general  we  call  <j>2  the  angle  of  the  incident  wave  propagation 

vector  (which  we  assume  to  be  in  the  x-z  plane,  as  in  Fig.  1) 

with  the  x-axis;  we  caU  it,  however,  when  we  want  to  specify 

that  the  incident  wave  is  an  s-wave. 

II.  ELASTIC  WAVE  SCATTERING.  The  equation  of  motion  for  the 
particle  dispTacement  if  in  an  elastic  medium  is 

(X  +  2u)v^  u  -  u^x(vxu)  *  p32U/3t2  (1) 

where  X  and  v  are  the  Lame  constants  of  the  medium  and  P  is  its 
density.  The  particle  displacement  u  is  obtained  from  a  scalar 
(  &  )  and  a  vector  (V)  potential: 

u  =  Vip  +  'vxV  (2) 

where  ^  contributes  to  p-waves  and  V  to  s-waves. 

Inserting  Eq.  (2)  into  Eq.  (1)  leads  to  two  wave  equations 
for  the  potentials  ^  and  V: 

v2*  =  (l/c2)— £ 

*2V  =  0/c|)||i 

where  Cp  *  {(x  +  2y)/p}^2  is  the  speed  of  p-waves  and  Cg  ~  (u/p)^^ 
is  the  speed  of  s-waves  in  the  medium. 

We  define 

Pi  «  ^sin^  {3) 

**1  =  kicos*i  (4) 

where  i=l,2,I,II  and  k^  =  .W;/c^  is  the  wavenumber.  The  angular 

frequency  of  the  wave,  which  we  assume  to  be  monochromatic,  is  w  , 
Snell's  law  requires 

sin  <J>  ^  sin  <t>  2  sin<j>  ^ 

C1  c2  CI 

leading  to 


sin 


II 


'II 
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(5) 


as 


P1  =  p2  “  PI  “  PII  =  p 

A.  P-WAVE  INCIDENCE.  The  incident  p-wave  can  be  expressed 


* 


i  _ 


(l/p2od2)e 


ik2*r 


ipz  iq^rcose 
Ne  e  L 


"  Ne1pznf0  eninJn(q2r)cosn0 
en  =  1  (n=0),  2  (n>0), 

where  we  have  used  the  Jacobi-Anger  formula 

We  now  introduce  the  following  compact  notation: 
N  =  l/p2u2 

cn  =  Neninelpzcos(ne) 
sn  =  Neninelpzsin  (ne) 

=  3c  /  se 


n 

jn  =  j 


n 


n' 
(qr) 


Bessel  function,  q=q.  and 
i=l ,2,1,11  according1^  case 


11  n  "  «n  (*r> 


(6) 


(7) . 

(8) 

(9) 


llankel  function  of  first  kind 
We  implicitly  assume any  time  an  index  n  occurs. 

In  this  notation,  Eq.  (6)  becomes 

li;1  =  J  C  . 
r  n  n 

In  the  case  of  scattered  p-waves  an  expansion  of  J  c  can  be 

n  n 

used,  but  with  coefficients  R  ,T  to  be  determined  through  a 
matching  at  the  boundary  (see  Sec.  D) .  Also  in  region  2,  where 
the  scattered  (reflected)  waves  are  outgoing,  H  must  replace 
J  *  i.e.,  n 


=  T  J  c 
v  n  n  n 


for  transmitted  p-waves  , 


(10) 


-  Rn  «n  °n 


for  reflected  p-waves  . 


775 


B.  INCIDENT  S-WAVES.  The  geometry  in  the  case  of 
incident  s-waves  as  illustrated  in  Pig.  2,  which  shows  that  V 
has  the  components: 

Vr=V ( cos x  sin©  +  sin  X  sin  ♦  cos  ©) 

V  =  V  (cos x  cos©  -  sinX  sin ^  sin©)  (11) 

y 

Vz  -  -V  sin  X  cos  $ 


where 


exp(i£jj*r) 


n  n 


(12) 


since  =  0  we  can  add  to  V  and  fo.  Now 

Vv=iqITcos©  V f-iqXIsin§  V  +  ipVz  (13) 

Therefore  choosing 

$>  =  -  (si  nxsi  n<j>V  )/(iqIX) 

and  adding  $$  to  V,  it  follows: 

Vr  =  Vcos x  sin  © 

V0  -  Vcosx  cos  © 

V_  =  -Vs  in  X  /cos4>  . 

Z 

Equation  15  shows  that  any  randomly  pClarized  incident  s-wave 
can  be  decomposed  as  a  linear  combination  of  a  X  =  0  wave  (called 
SH  or  horizontally  polarized)  and  a  x  =  ir/2  wave  called  SV  or 
vertically  polarized)  with  coefficients  cosx  and  sinx  r 
respectively. 

For  SH-waves  we  have 


(14) 


(15) 


Vr=Vsine  =  -ni  Jnsn/q1]; 

VQ  -  vcose  — i  J'cn/qlx 
vz  =  o 

and  for  SV-waves 

Vr  -  ve  -  0 

Vz  =  -V/fcos*  =-  Jn  Cn/cosd) 


(16) 


(17) 
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C.  SCATTERED  S-WAVES.  The  vector  potential  V  can  be 
expressed  as_^ 

V  =  v$  +  V0X2  +  xi 

where  $,e  and  x  are  solutions  of  the  Helmholts  equation 
(v2  +  k?)$  =0,  i  *  I, II 

In  equation  (18),  ^(3x/3z)  has  been  eliminated,  since  ^*vf  =  0 

for  any  function  f.  Also,  instead  of  k-x  we  have  simply  written 
x.  Now 

v  =  li  +  12®. 
vr  ~  3r  r  36 


-  1 21 
r  39 

=  21  H 

32 


3Q_ 

ar 


Writing 


(18) 

(19) 


(20) 


*  =  (i/qi)AnJnsn 

6  =  (i/qi)AnJnsn 
x  “  cn^nsn 

and  using  recurrence  relations  for  Bessel  functions  we  get  for  the 
components  of  V: 

Vr  =  -Wlsn 


(21) 


VJn+lcn 


(22) 


B  J  s 
n  n  n 


where  A  and  B  are  coefficients  to  be  determined  through  the 
matching  of  the  inside  and  outside  region  at  the  cylinder  surface. 

In  the  case  of  SV-waves  we  put 


*  *  (i/qi)AnJncn 

0  =  (-i/q1)AnJnSn 
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and  arrive  at  Eq.  22  with  sn  and  cr  interchanged.  To  treat 

both  cases  simultaneously  we  use  Eq.  22  with  the  understanding 
that  for  SV-waves  sn  and  cn  must  be  interchanged.  Also,  whenever 

a  difference  of  sign  occurs  we  write  +  or  +  with  the  lower  sign 

applying  to  the  case  of  incident  SV-waves. 


D.  THE  MATCHING.  At  the  surface  of  the  cylinder  r=a,  we 
impose  the  following  six  boundary  conditions : 


ulad  ’  v2°rrj  =  u2arj 

with  j  =  r,  ©,  z;  i.e,,  we  require  the  continuity  for  the  three 
components  of  the  displacement  u  and  stress  tensor  arj  of  the 

incident,  transmitted  and  reflected  waves.  This  gives  a  system 
of  six  equations^  in  the  six  unknowrs  R  ,  T  ,  A  .  B  .  F  ,  G  for 
the  three  cases  of 


(i)  incident  p-wave 

(ii)  incident  SH-wave 

(iii)  incident  SV-wave 


The  system,  in  the  more  general  case  (solid  inside  and  outside) 
is  given  in  Table  1.  In  the  first  six  columns  are  the  coefficients 
of  the  six  unknowns.  In  the  last  three  columns  are  the  inhomo¬ 
geneous  terms  in  the  three  cases.  If  inside  (or  outside)  we  have 
a  fluid,  then  y  =  0,  Eq.  2  and  3  expressing  continuity  of  u@  and 

u  do  not  apply,  and  A  -  =  0  (Frt  =  G„  =  0  for  fluid  outside) . 

z  n  n  n  n 

This  system  of  equations  may  be  solved  in  a  straightforward 
manner,  e.g.  by  using  Cramer's  rule.  In  the  following,  we  shall 
present  numerical  results  for  its  solutions  in  some  particular 
cases . 


III.  RIGID  INCLUSION  IN  ELASTIC  MEDIUM.  We  first  specialize 
the  above  solution  to  the  case  of  normal  incidence  of  p  or  s  waves 
on  a  rigid  cylindrical  inclusion.  The  determinant  of  the  system 
of  Table  1  which,  after  the  case  of  Cramer's  rule,  also  becomes 
the  common  denominator  of  the  coefficients  Rn  through  Gn,  reduces 
in  this  special  case  to 


det 


D 

D 


11 

D12 

31 

D32 

(24) 


where  a  is  the  cylinder  radius,  and 
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D 

D 

D 

D 


11 

12 

31 

32 


k2a  Hn-l(k2a)_nHn  (k2a) 
n  Hn  (kIIa) 

n (n+1)  Hn (k2a) -nk2a  Hn_1 (k2a) 

kIIa  Hn-l*kIIa)  +  [J|s(kIIa)  2'n(n+l)]  Hn(kxia)* 


(25) 


The  poles  of  the  scattering  amplitude  are  given  by  the  zeros  of 
Eg.  (24)  in  the  complex  frequency  plane.  These  were  obtained  by 
us  numerically,  and  are  shown  in  Fig.  3  for  the  case  of  the 
elastic  medium  being  aluminum,  plotted  in  the  complex  plane  of 
the  variable  k2a  .  These  zeros  are  seen  to  have  negative  imaginary 

parts.  Zeros  with  given  value  of  n  (as  indicated)  are  joint  by 
dashed  lines.  However,  as  was  shown  in  connection  with  sound 
scattering  from  a  sphere  [7^,  physical  interpretations  are  obtained 
if  one  considers  the  zeros  to  be  grouped  in  "layers",  connected  by 
solid  lines  in  Fig.  3  and  labeled  by  the  integer  £  as  indicated. 

It  was  shown  in  [7]  that  the  residues  of  the  scattering  amplitude, 
summed  over  the  poles  along  each  solid  line  in  Fig.  3,  synthesize 
a  surface  wave  which  for  the  case  of  a  cylindrical  inclusion, 
propagates  over  the  cylinder  surface  on  a  circular  path  for  normal 
incidence,  or  as  a  helical  path  for  oblique  incidence  [5].  The 
dispersion  curves  of  these  waves  are  now  being  obtained  using 
standard  methods  [8}.  In  addition  to  these  "bulk  type"  (p  or  s) 
surface  waves,  a  "Rayleigh  type"  surface  wave  is  also  found  from 
our  pole  calculations,  similar  to  the  case  of  a  spherical  cavity 
as  discussed  by  Norwood  and  Miklowitz  [9] .  In  addition,  corre¬ 
sponding  poles  and  surface  waves  were  obtained  for  an  empty  cavity 
in  aluminum. 


IV.  FLUID-FILLED  CAVITY  IN  ELASTIC  MEDIUM.  For  the  case  of 
normal  incidence  of  p  or  s  waves  on  a  fluid- filled  cylindrical 
cavity,  the  corresponding  determinant  from  Table  1  is 


where , 


D 

D 


13 

21 


det  =  l 

Dn 

D12 

°13 

D21 

°22 

D23 

(26 

°31 

D32 

0 

in  addition  to 

Eq. 

25, 

*=  n  Jn(k1a)-k1a  Jn_j_(k1a)/ 

=  £  n  (n+1)  -h  (k^a)  2  j  Hn(k2a)-k2a  Hn_1(k2a)  , 


d22  =  n  kna  Hn-l(kIIa)  -n(n+l)Hn(kITa) 
d23  "  Mp-,/p2)  <*!!*>  2  Jn  <*!!*>■ 


The  poles,  obtained  from  the  zeros  of  Eq.  (26)  for  a  water-filled 
cavity  in  Al  ,  were  found  to  be  numerically  close  to  those  for  p,s 
and  Rayleigh-type  surface  waves  of  the  above-mentioned  empty  cavity. 
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In  addition,  pole  layers  were  found  that  lay  very  close  to  the 
real  axis  of  the  complex  k2a  plane;  these  correspond  to  internal 

surface  waves  that  propagate  in  the  filler  fluid.  They  appear 
in  the  form  of  resonances  in  the  scattering  amplitude  when  the 
latter  is  plotted  vs.  frequency,  as  seen  in  Fig.  4.  This  figure 
shows,  for  backscattering ,  the  modulus  of  the  p-*p  scattering 
amplitude  R  for  n  =  0  through  5.  The  resonances  interfere 

destructively  with  the  background  amplitude  Rr  ^  for  ai  empty 

cavity,  and  if  the  latter  background  is  subtracted,  the  resulting 
pure  resonances  of  the  internal  surface  waves  appear  in  Fig.  5. 


The  resonance  frequencies,  which  on  the  k2a  scale  we 
designate  by  x  ,  can  be  read  off  this  figure;  and  provide  us 
with  the  real  parts  of  the  complex- frequency  poles.  The 
imaginary  parts  are  negligible  to  first  order,  indicating  a 
minimal  attenuation  of  these  internal  surface  waves.  Note  that 
the  same  resonance  frequencies  are  obtained  for  p-»p  and  s-*s 
scattering  as  well  as  for  p-»  s  and  s p  scattering  ( "mode  con¬ 
version")  since  physically,  they  all  originate  from  the  eigen- 
frequencies  of  the  internal  fluid. 


The  phase  and  group  velocities  of  the  internal  fluid  can  now 

be  obtained  from  x  as  follows  [4,5*1: 

n&  l  J 


cph  =  (c2/n)xrn  ' 

Cgp  “  c2  [  3(k2a)/  3  n] 


k2a  =  Xnjt 


(28) 

(29) 


In  Fig.  6,  we  show  the  corresponding  phase  velocity  dispersion 
curves  of  the  internal  sufrace  waves  as  obtained  from  Eq.  8. 


These  dispersion  curves  help  in  determining  the  refraction 
angles  that  appear  during  the  generation  of  a  surface  wave  by  an 
externally  incident  plane  wave,  as  outlined  in  Reference  [  5 J  . 
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FIGURE  CAPTIONS 


Fig. 

1. 

Geometry  of  p-wave  incidence  on  a  cylindrical  obstacle 
in  an  elastic  medium. 

Fig. 

2. 

Geometry  of  s-wave  incidence  on  a  cylindrical  obstacle 
in  an  elastic  medium. 

Fig. 

3. 

Poles  of  the  scattering  amplitude  for  a  rigid  cylindrical 
inclusion  in  aluminum,  plotted  in  the  complex  frequency 
plane. 

Fig. 

4. 

Modulus  of  p->p  backscattering  amplitude  vs.  frequency 
for  normal  incidence  on  a  water-filled  cavity  in  aluminum 

Fig. 

5. 

As  in  Fig.  4,  after  subtraction  of  an  empty-cavity  back¬ 
ground  amplitude. 

Fig. 

6. 

Dispersion  curves  of  internal  surface  waves  in  a  water- 
filled  cylindrical  cavity  in  aluminum. 
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STABILITY  OF  PLANE  NEAR-EQUIDIFFUSIONAL  FLAMES 
WITHOUT  INVOKING  THE  CONSTANT-DENSITY  APPROXIMATION 

T.  Jackson  and  A.  Kapila 
Rensselaer  Polytechnic  Institute 
Troy,  New  York  12181 


INTRODUCTION.  Near-equidif f usional  flames  (NEFs)  are  character¬ 
ized  by  near-unity  Lewis  numbers,  near-adiabatic  flame  temperatures 
and  near-uniform  enthalpies.  These  flames  are  special  three-dimensional 
solutions  of  the  combustion  equations  and  provide  a  convenient  framework 
for  the  theoretical  study  of  a  number  of  flame  phenomena.  An  extensive 
discussion  of  NEFs  appears  in  Chapters  2  and  8  of  Buckmaster  and  Ludford 
(1982),  where  a  set  of  reduced  equations  governing  these  flames  has  also 
been  derived.  This  paper  is  concerned  with  a  study  of  the  linearized 
stability  problem  associated  with  the  plane  NEFs. 

GOVERNING  EQUATIONS .  From  the  full  combustion  equations  involving 
Arrhenius  kinetics,  a  set  of  reduced  equations  appropriate  to  the  NEFs 
can  be  derived  in  the  limit  of  large  activation  energy  (e  0)  ,  under 
the  characterizing  assumptions 

l”1  =  1  +  ez/a  ,  H  =  T  +  ctY=l+a  +  Eh  , 

where  a  is  the  heat  of  reaction,  L  the  Lewis  number,  T  the  tempera¬ 
ture,  Y  the  mass  fraction  of  the  reactant  and  H  the  enthalpy.  All 
quantities  are  suitably  nondimensionalized ,  Additional  quantities 
appearing  below  are  the  density  p  ,  the  velocity  y  and  the  pressure 
p  (measuring  deviations  from  the  ambient  pressure  of  unity  on  the 
2 

0 ( M  )  scale,  M  being  a  representative  Mach  number  of  the  flame). 
Lengths  are  measured  on  the  scale  of  the  preheat  zone  and  a  frame  of 
reference  at  rest  in  the  laboratory  is  chosen.  Then,  to  leading  order 
in  £  ,  the  governing  equations  become 

(1) 


(2) 


(3a, b) 

where 


Do  Dv  _  2 

+  =  Q  r  =  T  +  V  v_  , 


DT  2 

p  Dt  ~  V  T  —  0  in  the  unburnt  region, 

T  =  1  +  a  in  the  burnt  region, 


p  —  -  V2h  =  (J£/a  )V2T  ,  pT  =  1  , 
Dt 


D 

Dt 


_3_ 

at 


+  v- 


V 
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and  for  simplicity,  Frandtl  number  is  set  at  unity.  These  equations 
are  supplemented  by  appropriate  boundary  conditions  and  jump  conditions 
across  the  interface  separating  the  burnt  and  unburnt  regions. 

The  solution  of  the  NEF  problem  in  full  generality  is  difficult, 
since  nonlinear  partial  differential  equations  on  either  side  of  a 
possibly  moving  interface  are  involved.  Most  of  the  progress  has  been 
made  on  the  basis  of  the  constant-density  approximation  (CDA)  which 
ignores  density  changes  suffered  by  the  gas  in  passing  through  the  flame, 
by  taking  p  =  1  and  omitting  (3b).  The  basic  system  then  reduces  to 
the  linear  equations  (2)  and  (3a),  in  which  v  is  taken  to  be  a  pre¬ 
scribed  solution  of  equations  (1).  Although  the  approximation  is  drastic, 
it  does  reproduce  qualitatively  several  features  of  real  flames. 

THE  STABILITY  PROBLEM.  Under  the  CDA,  the  linearized  stability 
problem  associated  with  a  plane  flame  leads  to  a  system  of  ordinary 
differential  equations  with  constant  coefficients,  which  can  be  solved 
readily  to  yield  the  relationship  between  the  growth  rate  oj  and  the 
transverse  wave  number  k  .  This  relationship  leads  to  the  dotted-line 
portion  of  the  stability  diagram  of  Fig.  1.  (Details  can  be  seen  in 
Buckmaster  and  Ludford  (1982),  but  the  original  work  is  due  to 
Sivashinsky . )  The  neutral  stability  curves  in  the  Zk-plane  produce  a 
stable  region  in  the  middle,  flanked  on  either  side  by  regions  of 
instability.  Bifurcation  to  cellular  structure  occurs  as  the  left 
stability  boundary  is  crossed,  while  passage  across  the  right  stability 
boundary  yields  a  pulsatile  flame. 

The  CDA  problem  can  be  derived  rigorously  from  the  full  NEF  problem 
as  the  leading- approximation  in  the  limit  a  -►  0  .  The  purpose  of  this 
paper  is  to  analyze  the  stability  of  a  plane  flame  without  involving 
the  a  -*■  0  limit.  After  considerable  manipulation,  the  stability 
problem  can  be  reduced  to  the  equations 


T ' *  -  T'  -  (k2  +  r — 7)T  -  (m  -  k2)aex  _  q 

1  +  aex  ' 

k2  )£ ( 1  +  x ) eX  + 
0  , 


-  h*  -  (k  + 


1  +  ae: 


:)h  +  (m  - 


+  a  (T*  +  — 7* - £ 

a  1  +  aex 


T  +  ame  )  = 


1  -  aex  . .  „  2  . 

m'  '  '  -  - - v  m'  1  -  (k  + 

1  +  aex 


1  +  ae: 


: )m*  - 


aeJ 


1  +  aex  1  +  aex  3 


-  t  k2)  m  + 


2a)0tex 


1  +  aex  (1  +  aex)2  +  3 


1  2 
+  T  k  )  T '  + 
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(1  k2  _  2a2e2x 


(1  +  aex)2  3  (1  +  aex)2 


)  T  - 


1  +  ae* 


P  + 


a>k2aex 


1  +  aex)^  ^ 


P’  -  ^(k2  +  — - -  x-2)T  -  i(l  +  - - “ - “)  T'  - 

3  (1  +  aex)2  3  1  +  aex 


-  (1  +  aex)m"  +  (1  -  |  aex)m'  +  (a)  -  |  aex  +  k2  (1  +  aex))m  + 

+  (- - -  z  +  k2(co  +  T  aex) )  =  0  , 

1  +  aex  3 


subject  to  the  boundary  conditions 


T  =  h  =  p  =  0  ,  m  »  -  w  at  x  =  -  °°  , 

and,  at  x  =  0  , 


T  =  0  ,  T'  -  -  h  =  0  ,  h '  +  —  T '  -  kh  =  0  , 

2  a. 


m  + 


_  SlL  +  MA  ±  *1 - 7{p  -  f  a(m  +  §)}  =  0 


1  +  a  *  A{k(l  +  a)  -  w> 
1+01  X2{k ( 1  +  a)  -  w} 


m  +  r-55—  +  ,k(x2  ~  k2) -  {p  -  f  atm  +  |)}  -  ^  - 


am' 


toT.' 


aa>T 


k2a 


A2  ( 1+a )  X2(l  +  a)2  X2(l  +  a)3  X2(l  +  a) 


=  0  . 


Here, 


X 


=  --  -{1  +  4  k2 
2  2 


4oo  ,1/2 
1+a  ' 


and  primes  denote  differentiation  with  resppect  to  x  .  The  reduction 
of  an  infinite  domain  to  a  semi-infinite  domain  (x  <  0)  is  achieved 
by  being  able  to  solve  the  problem  analytically  in  the  burnt  region 
(x  >  0)  .  The  variable  T,  h,  m  and  p  denote,  respectively, 
the  purturbations  in  temperature,  enthalpy,  mass  flux  and  pressure. 
Solutions  of  the  form 


<p  (x,  Yt  t ) 


(j>  (x) 


^iky-ki)  t 
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have  been  sought,  thereby  limiting  the  perturbations  to  only  two 
spatial  dimensions#  for  simplicity.  Three-dimensional  extension 
can  be  made  at  minor  additional  expense. 

Unlike  the  CDA  problem,  the  non-CDA  equations  have  to  be  treated 
numerically,  and  this  was  done  by  appending  a  root  finder  based  on  Muller's 
procedure  to  the  boundary-value  solver  COLSYS.  Numerical  results  for 
a  =  1  are  shown  in  Fig.  1  by  the  continuous  curves  through  the  actually 
computed  points  (denoted  by  asterisks).  The  right  stability  boundary  does 
not  appear  to  differ  much  from  the  corresponding  CDA  curve.  The  left  sta¬ 
bility  boundary,  on  the  other  hand,  deviates  substantially  from  its  CDA 
counterpart  for  small  k  ,  veering  sharply  to  the  right  to  narrow  the  sta¬ 
bility  region  for  long  waves.  Unfortunately,  the  non-CDA  results  are 
not  yet  available  for  the  approximate  range  5  <  l  <  10,5  ,  as  the  numerics 
breaks  down  in  that  region.  The  corresponding  problem  is  currently  under 
investigation.  For  large  k  ,  an  asymptotic  analysis  shows  that  the  non-CDA 
results  are  the  same  as  the  CDA  results,  to  leading  order.  This  fact  was 
confirmed  by  the  numerics. 

To  summarise,  the  effect  of  accounting  for  variation  in  density  appears 
to  be  most  pronounced  for  long  waves,  whose  region  of  stability  is  substan¬ 
tially  reduced. 
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ABSTRACT.  The  numerical  integration  of  a  nonlinear  singular  integral 
equation  governing  wrinkled  plane  flames  is  reconsidered,  and  the  discussion 
extended  to  its  counterpart  for  freely  expanding  cylindrical  flames.  Straight¬ 
forward  discretization  is  shown  to  give  unsatisfactory  results  and  a  sug¬ 
gestion  is  made  of  how  to  avoid  these  by  subtracting  out  the  principal  part 
of  the  solution  near  its  singularity. 


I.  INTRODUCTION.  The  nonlinear  singular  integral  equation 

TT 

y2(x)  +  ^[cot(^-)  +  cot(^)  -2cot|]y(z)  dz  =  0,  x  e  [0,ir),  (1) 

0 

represents  a  plane  flame  front  which  is  distorted  due  to  thermal  expansion 
of  the  gas  passing  through  it.  The  resulting  profile  of  the  flame  takes 
the  form  of  stationary  cells  which  are  regularly  shaped  and  spaced. 

The  variable  x  in  equation  (1)  is  the  space  coordinate;  the  function 
y(x),  which  is  odd  and  2Tr-periodic,  is  proportional  to  the  slope  of  the 
flame  profile  and  is  singular  at  x  =  tt  according  to: 
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y  ~  2  sgn(x  -  tt)  ln|x-ir|  as  x  +  tt. 


(2) 


This  result  has  been  shown  by  McConnaughey,  Ludford  and  Sivashinsky  [1] 
in  an  earlier  paper  where  equation  (1)  is  also  motivated. 

Equation  (1)  was  solved  numerically  in  reference  [1]  with  relation 
(2)  replaced  by  y  =-2  1n(ir-x)  for  x  less  than  but  near  tt.  The  wrinkled 
flame  front  which  was  calculated  is  shown  in  Figure  1. 

The  present  report  considers  the  more  general  representation  of  (2): 


y  =  2  sgn(x-ir)  1  n | tt  -  x|  +  k 


(3) 


for  x  near  tt,  where  k  is  an  unknown  constant.  It  is  noted  that  in  theory, 
the  value  of  k  should  not  significantly  affect  the  solution  of  (1);  however, 
the  numerical  solution  of  a  discretization  of  (1)  subject  to  (3)  can  be 
sensitive  to  k.  Application  of  a  straightforward  numerical  approach  to  (1) 
and  (3)  for  determining  y(x)  and  k  is  shown  to  yield  unacceptable  results. 

A  more  promising  approach  is  then  suggested. 

An  equation  similar  to  (1)  describing  hydrodynamic  instability  of 
freely  expanding  cylindrical  flames  is  also  discussed. 


II.  NUMERICAL  SOLUTIONS.  For  the  computation  whose  results  are 
reported  in  reference  [1]  and  for  many  of  the  calculations  of  the  present 
study,  the  solution  of  equation  (1)  is  approximated  at  discrete  points 
x.  =  (i-l)h,  with  h  =  xn+-j/N  and  i  =  2,  3,  . . . ,  N  -  1 ,  by  the  solution  of 

the  discretized  form  of  (1)  labeled  below  as  (4b).  To  obtain  this  dis¬ 
cretization: 

1)  The  upper  limit  of  integration  is  replaced  by  x^  =  3.13; 

ii)  the  trapezoidal  rule  is  used  to  approximate  the  ordinary 
integrals; 

iii)  the  principal -value  integrals  are  treated  by  expanding 
y(z)  in  a  Taylor  series  about  the  singular  points  then 
integrating  using  the  trapezoidal  rule  or  the  midpoint 
rule;  and 

iv)  the  derivatives  in  the  Taylor  series  are  approximated  by 
finite  differences. 

Because  y(x)  is  an  odd  function,  identity  (4a)  must  hold.  Finally,  re¬ 
lation  (3)  is  imposed  at  the  last  two  mesh  points.  The  system  to  be 


792 


solved  is  therefore 


=  y(0)  =  0; 


(4a) 


y,Z  =  y2(x.)  =  -  h{(  l  +  i  )[y  cot(-L— L)]  + 

1  1  0=2  j=i+2  0  c 

N  x.  +  x*  x.  y II ,  -i  xM , -  x .  x^+x. 

I  yjEcotl-Jy-L)  -  2  cot  /]  +  -N±l[  cotC-^-g-L)  +  cot(J^)  - 
j  ”  2 


i-2  N 


X.  -  X. 


2  cot-^ii]}  +  (yi.1-y1+1)(2  +  |cot|)+  2y2,  i  =  2 . N-l;  (4b) 


=  -  2  ln(ir-x^)  +  k,  i  =  N,  N  +  l. 


(4c) 


The  unknown  constant  k  in  expression  (4c)  cannot  be  determined 
analytically.  If  x^  and  x^+1  could  be  taken  sufficiently  close  to  it, 

the  value  of  k  would  be  of  little  importance.  The  solution  of  (4)  would 
vary  with  k  only  in  the  neighborhood  of  the  cusp  and  the  actual  flame 
profile  would  not  be  substantially  affected.  This  is  difficult  to 
accomplish  in  practice,  however,  because  of  the  weakness  of  the  logarithmic 
singularity. 

For  our  numerical  calculations,  the  value  of  k  in  (4c)  does  appreciably 
affect  the  solution  of  (4).  Furthermore,  a  multitude  of  values  for  k  are 
seen  to  admit  a  solution.  When  N  =  32,  for  example,  k-values  between  -50 
and  3  give  rise  to  a  solution  of  (4)  and  many  values  outside  of  this  range 
are  expected  to  do  likewise.  However,  the  solutions  corresponding  to  values 
of  k  less  than  -4.3  are  not  physically  acceptable.  (This  critical  number 
varies  with  N.) 

A  "reasonable"  value  for  k  may  be  selected  by  solving  an  appropriately 
modified  version  of  system  (4)  which  includes  k  as  an  unknown  to  be  calcu¬ 
lated.  The  extra  equation  needed  can  be  obtained  by  imposing  the  dis¬ 
cretized  form  of  (1),  in  addition  to  relation  (4c),  at  x...  This  adds  the 
equation  N 

«  N-2  x .  -  x..  N  x .  +  x.,  x . 

yN  =  yj  C0t(  -J-2-^)  +  l  yj[cot(  J  2  N)  -  2  cot  2^]}  - 

2  j-2 

.  2  -  h  »«  ^  ♦  ys.,<2*tco4)  *  2»z.  («) 
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The  numerical  solution  of  (4a -d)  for  several  values  of  N  produces 
the  results  illustrated  in  Figure  2.  It  appears  that  as  N  increases, 
k  may  gradually  tend  toward  some  fixed  value,  but  that  limit  (if  it 
exists)  remains  undetermined  due  to  the  large  N's  required  to  approach 
it*  Also,  the  magnitude  of  k  becomes  comparable  to  the  value  of  y  at 
and  x^+.|  thereby  canceling  the  imposed  asymptotic  behavior  (4c)  since 

k  is  negative.  The  numerical  solution  thus  seems  to  approach  the  zero 
solution  inasmuch  as  is  possible  as  N  increases.  This  trend  is  also 
demonstrated  by  a  flattening  of  the  profile  and  a  decrease  in  the  height 
of  the  cusp. 

Other  approximations  of  (1)  were  also  considered  in  the  present 
study  and  yielded  similar  results.  For  example,  system  (4a  -  d)  was  modi¬ 
fied  to  include  the  contribution  of  z  £  [xN+1  >tt)  to  the  integral  in  (1). 

The  solution  did  not  change  significantly.  Also,  the  problem  was  solved 
when  relation  (4c)  was  imposed  at  the  last  mesh  point  only  and  a  dis¬ 
cretized  form  of  (1)  was  imposed  on  y^  for  1  *  2,  3,  .  N+l.  In  that 

case,  the  value  of  k  obtained  for  each  N  was  somewhat  higher  than  that 
yielded  by  (4a  -  d) ,  nevertheless  the  computed  profile  was  similar  (see 
Figure  3).  In  all  cases  investigated,  the  numerical  solution  exhibits 
the  same  behavior  as  N  increases.  This  behavior  is  contrary  to  what  is 
known  about  the  solution  from  analytical  considerations  (relation  (2)). 
Thus,  these  numerical  results  are  not  acceptable. 


III.  ALTERNATE  APPROACH.  A  better  way  to  solve  equation  (1)  may 
be  to  introduce  and  solve  for  the  smooth  bounded  function  Y(x)  given  by 


Y (x)  =  y(x)  +  2  ln(ir-x)  -  k,  xe  [0,tt]. 


Then  Y(x)  and  k  satisfy  the  equations 


Y (0)  =  2  InTr-  k;  YU)  =  0; 

TT 

[Y(x)  -  2  InU  -  x)  +  k]^  +  |  [cot  (r^r)  +  cot(^-^-)  -  2  cot|*][Y(z)  - 

0 


2  InU  -  z)  +  k]d  z  =  0,  0  <  x  <  ir. 


(5) 


n 

k2  +  |[2  cot  (— -)  -  2  cot|-  -  ]  [Y (z)  -  2  InU  -  z)  +  k]dz  - 

0 


7T  » 

4k  ln*  +  jlTV  YUJdz  +  A+^jiy  *  f  +?TT)1nsds  =  °» 
0  b 
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Figure  2a.  Values  of  k  calculated  for  different  N's.  For  k  =  27,  k  lies  between 
.1  and  .2  as  compared  to  the  zero  value  assumed  in  reference  [1]. 
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A 


a  PROFILE  FOR  (4c)  APPLIED 
TO  XN+|  ONLY}  K  =  2.19  . 

°  PROFILE  FOR  (4c)  APPLIED 
TO  XN  AND  XN+|;  K  =  0.33  . 


igure  3,  Flame  profile  calculated  for  N  =  25  from  the  solution  of 
system  (4a-d)  and  from  solution  of  the  similar  system  in 
which  (4c)  is  imposed  at  only.  The  value  of  k  obtained 

in  the  first  case  is  .33  and  is  2.19  in  the  second  case. 
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where  b  >  1  and 


i4r)lnsds]- 


b 

A  =  4[ln2ir  -  ln2b  +  ^(  ^4y  + 

0 


The  solution  of  this  system  is  presently  under  numerical  investigation. 
For  x  near  t r,  equation  (5)  is  rewritten  as 

[Y(x)  +  k]2  +  4[ln  x+  In  (2tt  -  x)  -  2  In  7T  -  Y(x)]  InU  -  x)  + 


jl(x,z)[Y(z)  -  2  In  (it  -  z)  +  k]dz  -  2k[ln  x  +1  n  (2tt  -x)]  + 
0 


J  (7^7  -  2„-z-x)Y(z>dz  +  A  +  4  f  (jtt  -  f  +  ITT)ln  5  ds  =  °> 

0  b 

where  I  =  cot(-^-^)  -  +  cot(~^-)  +  ~  2  cot  §•  In  this  form, 

large  terms  in  (5)  which  are  potentially  troublesome  have  been  subtracted 
then  added  in  such  a  way  as  to  remove  possible  difficulties.  It  is 
assumed  that  Y(x) ->0  faster  than  1/lnU-x)  as  x  -mt. 


IV.  CYLINDRICAL  FLAMES.  The  equation  analogous  to  (1)  for  the  slope 
of  a  perturbed,  freely  expanding  cylindrical  flame  is 

TT  0 

y2(e)  +  i^[cot(^-|^-)  +  cot(^-)  -  2  cot|0  y($)d  <j>+  c  jy  U)d  <j>  =  0,  (6) 

0  0 


where  e  is  a  suitably  scaled  polar  angle  and  c  is  a  physical  parameter  for 
which  a  typical  value  is  near  16.  The  singular  behavior  of  y ( e )  at  e  =  it 
is  equivalent  to  that  given  in  (2).  Note  that  for  c  =  0,  equation  (6)  is 
the  same  as  equation  (1), 


The  numerical  approach  that  was 
(6).  However,  only  for  small  values 
realistic  values  of  c,  no  meaningful 


applied  to  (1)  has  also  been  applied  to 
of  c  has  a  solution  been  found.  For 
results  have  yet  been  obtained. 
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V.  CONCLUSION.  The  value  of  the  constant  k  which  makes  the 
asymptotic  statement 


y  -  2  sgn(x-ir)  ln|x-ir|  +  k  as  x  +  tt 


consistent  with  equation  (1)  or  equation  (6)  is  seen  to  have  importance 
in  numerical  calculations  of  the  solution  of  those  equations.  Attempts 
to  find  k  by  a  simple  and  straightforward  modification  of  the  numerical 
approach  in  reference  [1]  (where  k  is  assumed  to  be  zero)  are  shown  to 
fail.  Another  approach,  which  changes  equation  (1)  to  a  problem  whose 
solution  has  no  singularities,  is  suggested  but  not  pursued  here. 
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ABSTRACT .  The  moving  boundary  problem  that  arises  from 
a  certain  model  of  one-dimensional  detonation  waves  is 
considered.  Burgers  equations  for  a  function  f  have  to  be 
solved  on  the  two  sides  of  a  discontinuity,  at  which  f  and 
the  jump  in  its  derivative  (corresponding  to  the  exothermic 
reaction)  are  prescribed.  The  steady  solution,  a  summary  of 
its  linear  stability  characteristics,  and  some  preliminary 
numerical  results  are  presented. 


I .  INTRODUCTION .  We  consider  a  simple  model  of  plane 
detonation  waves  in  which  all  the  reaction  is  supposed  to 
occur  at  a  single  location  and  at  a  prescribed  (ignition) 
temperature.  For  combustion  waves  that  travel  at  finite 
Mach  numbers,  the  model  implies  very  fast  kinetics.  A 
complete  presentation  of  the  flame-sheet  theory  on  which  the 
present  discussion  rests  can  be  found  in  Stewart  and  Ludford 

u). 


Substantial  simplification  of  the  mathematical  problem 
occurs  when  the  heat  released  by  the  combustion  is  small,  as 
was  seen  last  year  when  we  presented  the  corresponding  (but 
fundamentally  different)  results  for  near  Chapman-Jouget 
deflagrations  ( 2 ).  Small  heat  release  leads  to  a  gasdynamic 
state  close  to  the  constant  state  ahead  of  the  wave.  Thus, 
the  temperature  and  pressure  are,  respectively, 

i  +  e % (y  -  i)«  +  o(b)  ,  i  +  e\f  +  o(b)  ,  (i) 

and  the  velocity  of  the  detonation  wave  is 

-i  -  e^c  .  (2) 


*Work  supported  by  the  U.  S.  Army  Research  Office. 
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Here  Y  is  the  ratio  of  specific  heats,  3  is  the  (dimension¬ 
less)  heat  released,  c  measures  the  closeness  of  the  wave 
velocity  to  the  undisturbed  sound  speed  (-1) ,  and  f  represents 
the  variation  of  the  gasdynamics  state  from  the  undistrubed 
state.  Following  l 2)  we  will  use  n  for  distance  in  a  frame 
moving  with  the  wave  and  T  for  time. 

The  function  f(n,T)  satisfies 


+  lii  C  k  (T) 


(3) 


f(0,T)=f*,  ^(O+.T) 


fn(0-,T)  =  -Yo/y  =  -  (y+l)a2/2y ,  (4) 


f  (-»,T)  =  0  ,  f  (+°°,T)  <  “ 


(5) 


where 


k  =  2c/(Y+l) 


(6) 


will  also  be  called  the  velocity  of  the  wave.  Here  n  =  0  is 
the  location  of  the  flame  sheet  (where  the  temperature  is 
constant) ,  f*  is  a  reactivity  parameter  fixing  the  flame 
temperature,  and  Y  is  the  mass  fraction  of  the  deficient 
reactant.  If  c(T)  °were  known,  only  four  boundary  conditions 
would  be  required  to  determine  f  .  However,  there  are  five 
because  the  condition  (4a)  counts  twice;  so  that  c  is 
determined  along  with  f  . 


II.  STEADY  SOLUTIONS  AND  THEIR  STABILITY »  From  the  Rankine- 
Hugoniot  jump  conditions  it  can  be  shown  that  there  is  a  minimum 
steady  detonation  velocity,  the  corresponding  wave  being  know  as 
a  Chapman- Jouget  detonation,  here  given  by  k  =  a  .  for  every 
k  >  a  and  f*  [f  ,2k]  ,  there  is  a  steady  solution 

C  £ 


where 


f±-k±/  k2-a 2  ,  = (Y+l) kn/Y  ,  C+- (Y+l)  (f+-f_) n/2y  .  (8) 


The  solutions  corresponding  to  f*  =  f_  and  f*  -*•  2k  have 
special  significance  in  detonation  theory.  For  f*  *  f_  ,  the 
weak  detonation  is  obtained,  the  velocity  k  being  uniquely 
determined  by  the  reactivity  parameter  f  +  .  As  f*  **  2k  ,  the 
ZND  detonation  is  obtained,  i.e.  a  shock  with  velocity  k 
followed  by  a  deflagration  adjusting  f  from  2k  to  f+  , 

The  linear  stability  of  a  steady  solution  can  be  examined  by 
taking  the  infinitesimal  perturbations  of  f  and  c  (or  k) 
proportional  to  exp (IT)  .  Stability,  which  is  determined  by  the 
sign  of  Re (X )  ,  is  found  to  depend  on  the  value  of  f*  .  (When 
this  mode  analysis  fails  to  find  instability,  solution  of  the 
corresponding  initial-value  problem  by  Laplace  transformation 
confirms  that  there  is  indeed  stability,  though  for  Chapman- 
Jouget  detonations  it  is  neutral.)  The  steady  detonation  is 
found  to  be 


Pig.  1.  The  f*  ,  c-plane  for  y  =  1.4  ,  Y  =  1  ,  showing  the 
region  where  the  steady  solution  exists  ana  its  stability. 

S  =  stable  ,  u  =  unstable  ;  the  CJ-solutions  are  marginally 
stable. 


(9) 


stable  for 


f_  <  f*  < 


a2/2  , 


unstable  for 


k  + 


/ 


kz  -  a/2  <  f*  <  2k 


(10) 


Figure  1  shows  the  region  of  the  f*  ,  k-plane  where  the  steady 
solution  exists  and  its  stability. 

Note  that 


f*  =  k  +  /  k2  -  a/2 


(ID 


is  not  a  conventional  stability  boundary:  the  single  (real) 
eigenvalue  by  which  instability  is  characterized  tends  to  +»  as 
the  boundary  is  approached. 


1II«  NUMERICAL  RESULTS.  We  now  discuss  the  numerical 
solution  of  the  problem  (3-5)  under  various  initial  conditions. 

A  description  of  the  implicit  numerical  scheme  used  can  be  found 
in  [  l) ;  a  spatial  step  of  0.1,  a  time  step  of  0.05,  and 
boundaries  at  n=±10  were  used  for  most  of  the  calculations 
shown. 

A  numerical  test  of  the  theoretical  stability  boundary  (11) 
leads  to  Figure  2.  The  initial  data  correspond  to  the  steady 
solution  for  y  -  1.4  ,  Y0  =  1  (used  in  all  the  computations 
shown) ,  and  c  *  2  ;  the  disturbance  corresponds  to  the 
discretization  error.  The  c(T)  profile  was  computed  along  with 
f  (n ,T)  for  various  values  of  f*  ,  so  as  to  determine  the 
numerical  stability  boundary.  (The  criterion  for  numerical 
stability  was  that  f  and  c  should  tend  to  their  steady-state 
values  in  the  computation  time  allotted.) 

In  Figure  2  the  theoretical  region  of  stability  is  hatched. 
Figure  2a  shows  the  values  of  c  at  T  -  2.5  for  different 
values  of  f*  ;  the  computed  stability  limit  seems  to  occur  at  f* 
=  3.25  ,  while  the  theoretical  value  (11)  is  f*  =  3.20. 

Typically  the  disturbance  of  c  (due  to  discretization  error)  has 
a  maximum  that  grows  as  f*  is  increased  towards  the  stability 
limit;  for  larger  values  of  f*  ,  the  disturbance  does  not  die 
out,  i.e.  no  steady  state  is  achieved.  Details  of  the  c,T- 
profiles  in  the  hatched  region  of  Figure  2b  are  shown  in  Figure 
3, 


Figure  4  shows  the  response  of  the  stable  detonation 
f*  =  1  /  c  =  1.2  to  a  larger  disturbance  in  the  burnt  mixture 
(h  >  0)  ,  The  dotted  line  is  the  steady  solution  and,  since  f 
is  proportional  to  the  pressure,  the  initial  data  shown 
correspond  to  a  rarefactive  disturbance. 

Figure  5  shows  the  response  of  the  unstable  detonation 
f*  =  1.8  ,  c  =  ccj  -  1.09545  to  a  similar  rarefaction.  Figure 
6  shows  the  corresponding  response  for  the  detonation  f*  =  1.5  , 
c  =  CCJ  ,  which  is  neutrally  stable  but  close  to  the  theoretical 
stability  limit.  The  numerical  scheme  failed  at  T  =  0.1992  , 
but  a  rudimentary  oscillation  in  c  is  obtained,  with  a  small 
but  nontrivial  change  in  f  . 

Figure  7  shows  the  same  detonation  as  in  Figure  6,  but 
subjected  to  a  stronger  rarefaction.  (These  results  were 
obtained  by  an  explicit  scheme,  with  spatial  step  0.2  and  time 
step  0,025.)  Almost  three  cycles  of  an  apparently  periodic 
oscillation  were  observed  before  the  scheme  failed  (at  X) . 


IV,  CONCLUDING  REMARKS.  The  numerical  calculations 
correlate  reasonably  well  with  the  theoretical  stability  results. 
Certainly  the  large  region  of  stability  is  confirmed,  and 
inaccuracy  in  locating  its  boundary  could  well  be  due  to  the 
discretization  error. 

Of  greatest  interest  is  that  our  results  suggest  the 
possibility  of  relaxation  oscillations  for  our  model.  Such 
oscillations  have  been  observed  in  numerical  simulations  of 
detonations  using  a  system  of  first-order  convective-reactive 
equations  [3),  and  have  been  dubbed  "galloping  detonations." 

We  are  currently  trying  to  improve  our  numerical  scheme  to 
overcome  the  difficulties  that  are  encountered  when  c  is  large. 
This  should  make  the  evidence  for  relaxation  oscillations  more 
convincing . 
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Fig.  6.  Response  of  a  neutrally  stable  CJ-detonation  to  a 
rarefaction  in  the  burnt  region.  Breakdown  at  T  =  0.1992  is  not 
shown  by  f-profile. 


ARE  DETONATIONS  STEADY?* 


A. A,  Oyediran  and  G.S.S.  Ludford 
Department  of  Theoretical  and  Applied  Mechanics 
Cornell  University,  Ithaca,  New  York  14853 


ABSTRACT .  A  generic  problem  in  the  transition  from 
deflagration  to  detonation  is  the  overtaking  of  a  steady 
deflagration  wave  by  a  steady  shock  wave.  Such  a  collision 
produces  a  detonation  wave  (as  well  as  a  contact  discontinuity,  a 
back  shock,  and  sometimes  a  rarefaction  wave).  Work  reported  at 
the  27th  Conference  of  Army  Mathematicians  showed  that,  for  small 
heat  release  in  the  deflagration,  the  detonation  wave  cannot  be 
steady.  Here  we  remove  the  restriction  to  small  heat  release  and 
show  that  the  detonation  wave  can  never  be  steady. 


I .  INTRODUCTION .  In  a  paper  presented  at  the  27th 
Conference  of  Army  Mathematicians,  Ludford  and  Stewart  (l) 
considered  the  shock-induced  transition  from  deflagration  to 
detonation  illustrated  in  Figures  1  and  2.  In  particular,  they 
showed  that,  for  small  heat  release,  the  resulting  detonation 
wave  cannot  be  steady.  The  object  of  this  paper  is  to  remove  the 
restriction  to  small  heat  release:  the  detonation  wave  that 
results  from  a  steady  shock  overtaking  a  steady  deflagration  wave 
can  never  be  steady,  according  to  the  ignition-temperature  theory 
of  deflagrations  and  detonations  developed  in  (2)  and  (3J. 


II.  RESULTS  FROM  THE  THEORY.  The  reactivity  of  the  fresh 
mixture  ahead  of  both  deflagration  and  detonation  is 
characterized  by  an  ignition  temperature  T*  .  Stewart  &  Ludford 
[2]  then  show  that  a  structure  exists  for  the  steady  deflagration 
only  if  the  ignition  temperature  lies  in  a  certain  interval 

T*(B)  <  T*  <  T*(B)  -  1  +  B  ;  (l) 


*Work  supported  by  the  U.S.  Army  Research  Office 
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Figure  1.  Outcome  of  a  sufficiently  strong  steady  shock 
mbw  overtaking  a  steady  deflagration  .  The 

lines  and  —  —  represents  detonation  and 

contact  discontiniui ty ,  respectively. 


Figure  2.  Modification  of  Figure  1  for  weaker  shock  strengths. 
The  detonation  is  now  Chapman-Jouget  with  a  centered 
rarefaction  lying  immediately  behind. 
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here  B  is  the  heat  released  by  the  reaction  and  the  temperature 
of  the  fresh  mixture  has  been  used  to  non-dimensionalize .  The 
right  limit  is  the  so-called  adiabatic  flame  temperature,  when  no 
heat  is  used  to  increase  the  kinetic  energy;  the  left  limit, 
which  will  not  be  used  in  our  discussion,  must  be  obtained  by 
numerical  integration. 

Given  the  Mach  number  M  of  a  detonation,  it  is  well  known 
(4)  that  the  reaction  rate  must  lie  within  certain  limits.  For 
the  ignition-temperature  theory,  Lu  &  Ludford  ( 3}  show  that  this 
leads  to  the  requirement 

T*(3,M)  <  T*  <  T*(B,M)  ,  (2) 

where  y- - 

¥*=i+e+  izi  M2(i-v2)  with  v=  inM2V(M2-i)2-2e(Y^)M2  (3) 

2  (y+1)M2 

A 

but  T*  must  be  obtained  by  numerical  integration  (we  shall  not 
need  it) .  Here  y  is  the  specific-heat  ratio  and  we  find 

(1  +  YM2)/(Y  +  1)M2  <  v  <  1  .  (4) 


III.  APPLICATION  TO  THE  PRESENT  PROBLEM.  The  Mach  number 
of  the  detonation  in  either  figure  is  determined  by  the  strength 
of  the  shock  wave.  It  cannot  be  smaller  than  the  Chapman-Jouget 
value,  i.e. 


M, 


CJ 


<  M  <  «  with  =  1+  (y  +1)B +/( l+(y+l)3)  -1  . 


vfj 


(5) 


When  the  shock  is  sufficiently  strong,  as  in  Figure  1,  the  Mach 
number  is  greater  than  MCJ  ;  otherwise  it  equals  Mri  ,  as  in 
Figure  2.  We  shall  now  snow  that,  whatever  the  Mactinumber ,  the 
intervals  (1)  and  (2)  do  not  overlap;  specifically 


¥*(S,M)  >  for  all  Me  (M  ,«)  , 


CJ‘ 

which  is  clearly  ensured  if  the  following  inequalities  hold: 

a 


T*(B,MCJ)  >  T*(B,«o)  >  T*  (B  )  , 

dT*/dM  <  0  for  all  M  e  (MCJ,«)  . 


(6) 

(7) 

(8) 
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Proof  of  the  inequalities  (7)  follows  directly  from  the 
definition  (3);  thus,  _ 

T*(B,Mcj)  =  1  +  yB  +  (Y-l){/fl  +  (y+1)B)2-1  -  B}/(y+1) 
>  1  +  y6  -  T,(B,»)  >  1  +  B  “  T*(B)  . 


The  definition  also  gives 

dT*/dM  =  (y-l)M(l  -  v2  -  2vM2dv/dM2] 

=  -(y-I)M(I-v) ( (1+yM2)+(1-yM2)v]/2( (y+1)M2v- (1+yM2) )  , 


where  we  have  used  the  quadratic  equation  satisfied  by  v  rather 
than  its  solution  (3b) .  The  bounds  (4)  on  v  now  give  the 
inequality  (8) . 
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ABSTRACT  *  During  the  heat  treatment  of  components,  the  transient  thermal 
stresses  can  be  very  high.  This  is  especially  true  if  a  severe  quench  is 
required  such  as  the  quenching  of  steel  gun  tubes  for  the  development  of  a 
martensitic  grain  structure.  In  addition  to  the  large  transient  thermal 
stresses,  severe  transformation  stresses  also  exist  due  to  the  structural 
volume  change  involved.  If  these  stresses  are  high  enough,  inelastic  response 
of  the  material  must  be  considered  and  residual  stresses  will  exist  In  the 
structure  when  the  process  is  complete.  In  this  paper,  both  thermal  and 
transformation  stresses  are  computed  for  various  quenching  procedures  using  a 
hollow  tube  for  the  geometric  model.  The  relative  severity  of  the  thermal  and 
transformation  stresses  and  the  conditions  under  which  they  occur  are 
discussed,  A  general  purpose  finite  element  code,  ADINA,  Is  used  for  the 
computation, 

I ■  INTRODUCTION ,  Rapid  quenching  of  hollow  steel  tubes  from  Initially 
high  temperatures  is  usually  undertaken  to  produce  desired  structural 
properties  in  the  tubular  components.  This  technique  has  been  utilized  in  the 
fabrication  of  large  caliber,  long  gun  tubes.  The  Initial  high  temperature  of 
843°C  (1550°F)  giving  the  steel  gun  tube  an  austentic  grain  structure  is 
cooled  to  less  than  100°C  (212°E)  in  just  a  few  minutes  using  various  water 
spray  methods.  The  desired  end  result  is  a  martensitic  structure  In  the 
material . 

Due  to  the  transient  temperatures  created  by  the  water  spray,  the  large 
thermal  gradients  in  the  tube  wall  give  use  to  thermal  stresses  which  change 
rapidly  in  time.  In  addition  to  these  stresses,  as  the  material  grain 
structure  transforms  Into  martensite,  a  volume  expansion  of  the  structure 
occurs.  This  expansion  associated  with  the  martensite  transformation  gives 
rise  to  additional  transient  stresses,  called  transformation  stresses.  If  the 
stresses  that  occur  during  this  procedure  are  below  the  yield  stress  of  the 
material ,  as  the  temperature  equilibrates  to  room  temperature  and  as  the 
transformation  is  completed,  the  quenched  tube  is  stress  free,  i.e.,  no 
residual  stresses  exist.  However,  if  the  stresses  are  such  that  some 
inelastic  deformation  occurs  in  the  tube,  then  there  will  exist  residual 
stresses  in  the  tube  after  it  is  quenched.  The  latter  Is  the  more  likely 
case. 


This  paper  Investigates  the  transient  temperatures  and  stresses  that 
a  steel  gun  tube  experiences  In  such  a  quenching  operation.  The  material 
properties  and  system  parameters  used  in  the  computations  are  chosen  using  the 
rotary  forge  quench  facility  at  Watervliet  Arsenal  as  a  model.  Specifically, 
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a  memorandum  [1]  describing  the  results  of  an  experimental  quench  test  was 
used  to  generate  the  approximate  convection  heat  transfer  coefficients  for  the 
study.  Most  of  the  interest  is  centered  on  the  breech  (or  rearward)  end  of 
the  tube  and  the  muzzle  (or  forward)  end  of  the  tube,  with  emphasis  on  the 
muzzle  end.  This  is  because  quench  cracks  are  more  frequent  at  that  location. 
In  addition  to  considering  the  two  locations  along  the  tube,  the  effect  of 
altering  the  quench  cycle  is  also  considered.  Nozzles  spray  water  on  the 
outer  diameter  of  the  tube  as  it  is  slowly  rotated.  Sprays  on  the  breech  and 
muzzle  end  can  be  separately  controlled.  The  quench  on  the  bore  or  inside 
diameter  of  the  tube  is  accomplished  by  flushing  the  bore  with  water.  It  is 
of  interest  to  know  the  effect  of  this  flush  when  it  occurs  concurrent  with 
the  water  spray  on  the  outside  diameter,  or  when  it  begins  slightly  before  or 
after  the  OD  quench. 

To  compute  the  temperatures  and  stresses  mentioned  in  the  above 
paragraph,  a  general  purpose  finite  element  computer  program,  ADINA  [2],  is 
used.  The  problem  was  treated  as  an  axi symmetric  plane  strain  in  two 
dimensions.  Separate  analyses  or  runs  are  undertaken  for  the  breech  and 
muzzle  sections  using  their  respective  geometries.  Any  tapering  of  the  gun 
tube  is  ignored  as  are  end  effects.  Although  the  geometry  is  simple,  the 
problem  is  complicated  due  to  the  highly  transient  nature  of  the  problem  and 
the  consideration  of  temperature  dependent  material  properties  and  yield 
strength.  The  properties  used  in  the  report  for  the  computations  are  stated 
in  the  Appendix. 

II .  PROBLEM  STATEMENT .  Thermal  and  transformation  stresses  are  computed 
for  long  hollow  cylinders  subjected  to  different  quench  cycles.  Two 
geometries ,  representing  the  breech  and  muzzle  sections  of  a  gun  tube ,  are 
modeled.  Effects  of  altering  the  quench  cycle  of  the  bore  are  considered. 

III.  FINITE  ELEMENT  PROGRAM.  The  finite  element  geometry  for  the 
problem  is  shown  in  Figure  X  along  with  a  simplified  drawing  of  a  gun  tube. 
Eight  node  quadrilateral  elements  were  used  for  the  model.  Sixteen  elements 
are  used  at  the  muzzle  and  twenty  at  the  breech.  Some  preliminary  work  was 
done  with  other  models  (4  node  quadrilaterals  and  10  element,  8  node  models) 
and  it  was  decided  that  the  current  model  gave  sufficient  accuracy  and 
smoothness  of  results. 

The  finite  element  program  actually  consists  of  two  parts,  one  for 
computing  temperatures,  ADINAT,  and  one  for  computing  stresses,  ADINA.  Each 
program  can  stand  alone,  but  when  one  wishes  to  compute  thermal  stresses  using 
the  same  geometry,  ADINAT  produces  a  file  which  includes  the  temperatures  at 
the  nodes  for  each  time  step  if  the  problem  is  a  transient  one.  This  ADINAT 
output  file  can  then  be  used  as  input  to  ADINA  for  the  stress  computation. 

In  the  program,  the  thermo -physical  properties  were  considered  as 
functions  of  temperature  and  the  values  are  recorded  in  the  Appendix.  The 
convection  losses  during  the  heat  transfer  portion  of  the  computation  are 
considered  to  be  due  to  the  temperature  difference  between  the  tube  wall  and 
ambient,  which  is  taken  to  be  18.3°C  (65°F).  For  the  computation  of  stresses, 
one  has  a  choice  of  several  material  behavior  models  in  ADINA.  The  one  chosen 
for  this  work  was  Model  10  [3],  which  is  applicable  to  the  thermo -el as tic- 
plastic  solution  of  interest.  The  yield  criterion  assumed  is  the  distortion 
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energy  criterion  and  the  yield  stress  is  assumed  to  be  a  function  of 
temperature.  No  creep  or  hardening  is  assumed  although  the  model  allows  both 
to  be  incorporated* 

To  compute  thermal  stresses,  the  problem  for  the  transient  temperatures 
is  just  solved  using  ADINAT  as  indicated  above*  The  special  file  created  by 
ADINAT  is  then  used  as  input  to  ADINA  to  compute  the  thermal  stresses. 

However,  in  many  cases  in  solving  the  temperature  problem,  short  time 
increments  are  used  during  periods  of  large  transients  and  longer  time 
increments  when  the  temperature  gradients  are  not  as  severe.  While  ADINAT 
allows  one  to  change  time  increments,  ADINA  does  not.  This  difficulty  was 
overcome  by  manipulating  the  file  used  for  stress  computation  so  that  with  the 
restart  capability,  ADINA  wauld  see  only  one  time  increment  during  any  one 
computation  interval.  The  temperature  file  could  also  be  manipulated  to 
decrease  the  temperature  difference  between  time  steps  with  the  time  increment 
appropriately  changed.  Finally,  the  restart  facility  in  ADINA  [4]  was  altered 
so  that  a  restart  could  be  undertaken  from  any  previous  time  instead  of  just 
the  last  completed  step.  Simple  linear  interpolation  is  used  in  any  file 
manipulation. 

The  computation  of  transformation  stresses  and  combined  thermal  and 
transformation  stresses  can  be  treated  like  thermal  stresses  with  additional 
effort.  The  effect  of  the  transformation,  at  least  the  aspect  of  it  giving 
rise  to  stresses,  is  to  create  a  volume  change  in  the  material.  In  this  case 
the  volume  change  is  an  increase  and  it  occurs  when  the  temperature  at  a  point 
in  the  material  becomes  equal  to  the  martensite  start  (Mg)  temperature  and  is 
completed  when  it  reaches  the  martensite  finish  (Mp)  temperature.  The  volume 
change  due  to  the  martensite  transformation  is  an  expansion  of  about  3  to  4 
percent.  If  the  transformation  is  assumed  to  be  isotropic,  this  represents  a 
lineal  expansion  of  1  to  1-1/3  percent.  The  form  of  the  terra  giving  rise  to 
thermal  stresses  is  [3] 


alT~TREF]E  (1) 

where:  a  coefficient  of  linear  expansion 

T  temperature 

Tref  a  reference  temperature  for  zero  strains 

E  Modulus  of  Elasticity 

Equation  (1)  can  also  be  used  to  compute  the  stresses  due  to  the 
transformation  if  a  Is  modified. 

Suppose  a  Is  the  modified  coefficient  for  the  transformation  stress 
computation.  Then  a  is  zero  until  Mg  is  reached.  When  the  transformation  is 
just  complete,  the  temperature  is  Mp  and  a  change  in  length  per  unit  length, 
strain  e  ■  A £/£,  is  one-third  the  volume  change  or  .0133.  Between  Mg  and  Mf, 
the  transformation  is  assumed  to  progress  linearly.  Once  Mp  is  reached ,  the 
expansion  is  assumed  to  be  permanent.  Thus,  at  Mg  =*  325°C  (617°F)  [5],  o  =  0 
and  at  Mp  ~  260°C  (500°F)  [5],  aT  =  .0133.  Thus  assuming  Tref  -  0  in  this 
case,  ot  ■  2.666  x  10~5.  Finally,  as  the  temperature  cools  below  Mp,  o'  must  be 
adjusted  so  that  aT  remains  constant  and  equal  to  ,0133,  i.e.,  the  linear 
strain  remains  constant.  If  there  was  no  inelastic  deformation,  when  the 
transformation  was  complete,  the  transformation  stresses  should  vanish  as  the 
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effect  of  the  transformation  *»uld  be  that  of  a  uniform  expansion*  A  plot  of 
a  vs,  temperature  is  shown  in  Figure  2. 

To  show  the  desired  response,  a  problem  was  run  for  transformation 
stresses  alone,  with  a  replacing  the  coefficient  of  thermal  expansion. 

Plastic  response  was  suppressed  by  using  an  artifically  high  yield  strength. 
The  resulting  stresses  after  the  transformation  was  completed  approached  zero 
as  they  should  have  (due  to  the  final  uniform  expansion) ,  Because  only  a 
piecewise  linear  approximation  to  oc  is  allowed  (16  points),  stresses  did  not 
completely  vanish,  but  if  the  16  points  are  grouped  immediately  following  the 
transformation,  the  results  at  that  time  are  improved,  i.e.,  nearer  zero 
stress.  At  later  times  (and  cooler  temperatures)  where  the  approximation  was 
now  cruder,  the  results  worsened.  This  indicated  that  the  assumptions  made  in 
computing  a  were  correct. 

In  a  similar  manner,  a  coefficient,  a*,  can  be  constructed  so  that  the 
combined  thermal  and  transformation  stresses  could  be  computed.  This  was  done 
by  solving  for  a*  from 


a*(T-TREF)E  *  aTE  +  a(T-TRgF)E 

The  quantities  have  been  previously  defined.  Thus,  the  required  coefficients 
are  found  from 

T 

ci*(T)  -  a(T)  +  a(T) - 

T-Tref 

This  is  shown  in  Figure  2  and  in  the  Appendix,  Computations  were  made  with 
each  of  the  three  coefficients.  Plastic  response  was  again  suppressed.  Table 
1  shows  some  of  the  results.  Summing  the  results  of  the  separate  computations 
for  thermal  stress  (a)  and  transformation  stress  (a)  should  give  the  results 
for  the  combined  solution  (a*).  While  this  is  done  elastically,  similar 
behavior  should  be  expected  in  an  elastic-plastic  solution  as  computed  as  a* 
for  a  new  "thermal  load"  increment  would  not  be  affected  by  the  type  of 
solution. 


TABLE  1.  RESULTS  USING  COEFFICIENTS  FOR  COMBINED 
THERMAL  AND  TRANSFORMATION  STRESS 


Node 

Location 

Thermal 

Stress 

(1) 

Trans formation 
Stress 
(2) 

Combined 

Stress 

(3) 

(1)  +  (2) 

%  Diff 

87,75 

1,1 

15,444 

2,133 

17,184 

17,577 

2.3 

7,1 

-39,681 

1,879 

-38,165 

-37,802 

1,0 

16,1 

85,102 

1,600 

86,436 

86,702 

.3 

107.75 

1,1 

8,472 

74,911 

32,132 

83,383 

1.5 

7,1 

-31,339 

65,996 

33,538 

34,657 

3.3 

16,1 

62,566 

-504,735 

-442,237 

-442,169 

.01 

NOTE:  Plastic  behavior  is  suppressed. 


320 


IV.  RESULTS.  Figure  3  is  typical  of  the  type  of  information  that  can 
result  from  a  study  of  this  type.  It  shows  the  response  of  the  breech  end  of 
the  tube  versus  temperature  on  a  Time-Temperature-Transformation  Diagram.  The 
word  typical  is  used  above  because  the  location  of  the  balnite  start  curve  is 
not  known  for  the  actual  material  modeled.  The  bainite  start  curve  shown  is 
for  a  steel  having  the  composition  shown  in  Table  2.  Current  gun  steel 
composites  contain  alloying  elements,  such  as  vanadium,  which  push  the  bainite 
curve  to  the  right.  The  transient  cooling  curves  superimposed  on  the  figure 
represent  the  cooling  of  the  breech  end  of  a  105  ram  M68  tube.  The  quenching 
of  the  outer  diameter  was  delayed  30  seconds  relative  to  the  bore  quench.  The 
convective  heat  transfer  coefficients  were  found  by  assuming  the  OD 
temperature  of  the  breech  reached  ~  200° F  in  eight  minutes.  Assuming  the  bore 
quench  to  be  less  efficient  and  since  no  experimental  temperatures  were 
available  as  on  the  outside  diameter,  the  convective  heat  transfer  coefficient 
was  taken  to  be  one-half  that  of  the  OD.  Figure  4  shows  the  temperature 
distribution  throughout  the  wall  at  various  times  for  the  same  geometry  and 
quench.  At  the  first  time  (2.5  seconds)  one  can  see  the  bore  beginning  to 
cool  and  with  no  change  on  the  outer  diameter.  The  second  curve  shown  at  32.5 
seconds  occurs  just  after  the  OD  quench  begins,  and  the  subsequent  temperature 
distributions  show  that  the  OD  is  cooled  more  rapidly  than  the  bore.  As  the 
quench  approaches  the  end  of  its  cycle,  the  temperature  distribution 
throughout  the  wall  does  not  vary  much.  The  temperature  distributions  vary 
even  less  in  the  muzzle  section,  e.g..  Figure  8, 

TABLE  2.  COMPOSITION  OF  STEEL  FOR  BAINITE  CURVE  SHOWN 


c 

.31 

Ni 

5.07 

Mn 

.76 

Cr 

1.22 

Si 

,30 

1*> 

,48 

P 

.009 

A1 

.031 

S 

.023 

Figure  5  shows  the  variation  of  stress  with  time  for  a  material  element 
on  the  bore  surface  and  on  the  outer  surface.  The  highly  transient  nature  and 
the  severity  of  the  stresses  can  easily  be  seen.  Most  points  can  be  easily 
explained.  Point  1  indicated  the  beginning  of  the  quenching  of  the  outer 
diameter  at  30  seconds  into  the  quench  cycle.  Only  thermal  stresses  exist 
until  point  2  is  reached  at  about  210  seconds  when  the  outer  diameter  begins 
to  transform  into  martensite.  At  point  3,  plastic  deformation  begins  on  the 
outer  surface.  At  point  4,  the  bore  begins  its  transformation.  The  bore 
develops  plastic  deformation  at  about  290  seconds  and  the  bore  transformation 
ends  at  point  6.  The  stresses  beyond  350  seconds  on  the  residual  stresses 
that  exist  at  those  material  elements  after  room  temperature  is  reached. 
Figures  6  and  7  show  the  stress  distribution  throughout  the  wall  for  two 
different  times.  Figure  6  is  included  to  show  that  the  stress  can  vary  quite 
strongly.  The  time  is  that  when  the  bore  first  develops  inelastic  behavior. 
Figure  7  is  the  residual  stresses  that  would  exist  after  the  temperature 
equilibrates. 
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The  remainder  of  the  results  were  generated  for  the  muzzle  section  as 
this  was  an  area  of  interest  due  to  quench  cracking  under  certain  conditions. 
Four  different  quenches  were  considered: 

1,  No  quench  delay  -  the  bore  quench  and  OD  quench  are  initiated 
simultaneously  (Figures  8  through  11). 

2.  30  second  prebore  quench  -  the  bore  quench  is  initiated  30  seconds 
before  the  OD  quench  (Figures  12  through  15). 

3.  30  second  postbore  quench  -  the  OD  quench  is  initiated  30  seconds 
before  the  bore  quench  (Figures  16  through  19). 

4,  No  bore  quench  -  the  bore  is  not  quenched  in  this  case  (Figures  20 
through  23) . 

Figures  8,  12,  16,  and  20  show  the  temperature  distribution  throughout  the 
wall  for  various  times  throughout  the  respective  quench  cycles.  Except  for 
the  initial  few  curves,  there  does  not  appear  to  be  great  differences  between 
the  quenches.  The  first  two  or  three  temperature  curves  will  indicate  the 
type  of  quenching  undertaken,  i.e.,  whether  the  bore  is  quench  first  or  not  at 
all,  etc.  One  can  easily  see  however,  that  as  time  into  the  quench  cycle 
progresses,  the  temperature  distributions  become  rather  flat.  This  indicates 
that  the  temperature  equilibrates  rapidly  due  to  the  high  conductivity  of  the 
steel.  This  is  especially  true  when  the  bore  is  quenched  as  in  any  of  the 
above  conditions.  For  the  case  when  the  bore  is  not  quenched,  there  is  an 
interesting  occurrence  for  the  parameters  chosen.  While  the  stresses  on  the 
outer  surface  became  plastic  on  all  the  above  runs,  the  bore  saw  plastic 
deformation  only  for  the  case  of  no  bore  quench.  This  will  be  discussed  in 
more  detail  later. 

The  transient  temperatures  versus  time  are  shown  for  each  case  for  the 
bore  area,  the  midsection  or  core,  and  the  outer  diameter  in  Figures  9,  13, 

17,  and  21.  This  type  of  curve  was  discussed  previously.  The  intent  of  the 
quench  is  to  cool  the  material  rapidly  enough  to  pass  before  the  *knee'  of  the 
bainite  curve  in  order  to  form  martensite  structure*  As  mentioned  previously, 
the  bainite  start  curve  shown  is  not  that  for  the  current  material  used,  but 
for  one  with  a  different  alloy  content  [5]. 

The  following  curves  (Figures  10,  14,  18,  22)  show  the  variation  of 
stress  with  time  for  each  of  the  quench  cycles.  The  highly  transient  behavior 
of  the  problem  becomes  very  visible.  For  example,  in  the  30  second  prebore 
quench  case  (Figure  14)  at  30  seconds  into  the  quench  cycle,  the  OD  quench 
begins  and  the  circumferential  stress  on  the  OD  becomes  tensile.  At  time  121 
seconds,  the  martensitic  transformation  begins  on  the  outer  diameter.  This 
material  area  tries  to  expand  but  it  constrained  by  the  as  yet  untransformed 
material,  hence  it  is  put  into  compression.  At  about  125  seconds,  the 
material  on  the  0D  had  yielded.  The  bore  transformation  then  begins  at  about 
133-134  seconds  causing  an  abrupt  change  in  stress  at  that  point.  Finally, 
the  transformation  of  the  OD  material  is  completed  followed  by  the 
transformation  of  the  bore  material.  The  material  on  the  OD  is  subjected  to 
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plastic  deformation  (in  tension)  just  before  completion  of  the  transformation* 
The  bore  material  saw  no  plastic  deformation  at  any  time.  Again,  most  of 
these  figures  are  similar  with  the  exception  of  Figure  22  for  the  case  of  zero 
bore  quench.  Here  one  can  see  that  the  stresses  at  the  bore  are  larger  and 
compressive  much  of  the  time* 

The  final  set  of  figures.  Figures  11,  15,  19,  and  23,  show  the  resulting 
residual  stresses  for  each  quench  cycle.  Again,  these  all  have  a  similar 
shape  with  the  exception  of  Figure  23  for  the  case  where  the  bore  was  not 
quenched.  As  mentioned  in  the  above  paragraph,  the  bore  developed  plastic 
deformation  only  for  this  case.  Two  companies  have  provided  steels  with 
similar  chemistry  (or  alloy  content)  for  production  runs  of  gun  tubes.  Tubes 
forged  from  one  of  the  steels  had  a  much  higher  frequency  of  quench  cracking 
until  the  more  severe  no  bore  quench  cycle  was  used.  While  no  direct  effort 
has  been  made  to  verify  this  at  this  time,  the  development  of  quench  cracking 
may  have  been  prevented  by  the  different  stress-time  behavior  and  resulting 
residual  stresses  in  this  case  where  the  bore  area  seems  to  be  under 
compression  for  longer  times. 

V*  FUTURE  WORK.  Concurrent  efforts  in  the  quench  crack  problem  have 
been  undertaken  and  will  be  providing  more  accurate  experimental  data  for 
input  to  this  analysis.  This  refers  to  a  better  estimate  of  the  volume  change 
in  the  temperature,  as  well  as  the  way  that  the  volume  changes  in  time.  An 
up-to-date  bainite  curve  is  being  determined  but  preliminary  indications  are 
that  it  will  not  be  in  any  area  of  the  TTT  diagram  that  would  cause  problems 
if  the  quenching  rates  are  held  the  same.  It  is  hoped  that  more  accurate  heat 
transfer  coefficients  can  also  be  determined.  Incorporating  this  new 
information  in  the  analysis  will  certainly  lead  to  a  better  understanding  of 
the  events  occurring  during  the  quenching  of  long  hollow  tubes. 
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APPENDIX 


MATERIAL  PROPERTIES 

Table  A1  shows  the  mechanical  properties  used  in  the  program  as  a 
function  of  temperature.  As  only  16  points  are  allowed  for  describing  a 
function,  not  all  points  were  used  in  all  calculations.  The  main  source  of 
the  properties  is  noted  in  Reference  6*  The  tables  shown  include  property 
values  at  some  temperatures  which  were  found  by  linear  interpolation* 


TABLE  A1 .  MECHANICAL  PROPERTIES  AS  A  FUNCTION  OF  TEMPERATURE 


Temperature 

(F) 

- - 

Young's  Modulus 
(xlOE6  psi) 

Yield  Stress 
(psi) 

65. 

30. 

160000. 

100. 

29.84 

157  560. 

150. 

29.6 

154080. 

200. 

29.37 

150060. 

225. 

29.26 

148860. 

250. 

29.14 

147120, 

275. 

29.02 

145380. 

300. 

28.9 

143640, 

325. 

28.78 

141910. 

350. 

28.67 

140170. 

375. 

28.56 

138430. 

400, 

28.44 

136680. 

425. 

28.32 

134950. 

450. 

28.20 

133210. 

485, 

28,04 

130770. 

500. 

28.03 

130750. 

560. 

27.67 

126760. 

617. 

27.34 

122980. 

635. 

27.32 

122800, 

1200. 

27.00 

88000, 

1560. 

18.60 

65600. 
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TABLE  A2.  THERMAL  AND  TRANSFORMATION  COEFFICIENTS  USED  IN  THE  STRESS  PROGRAM 


Temperature 

(T) 

Coefficient 
of  Thermal 
Expansion 
(E-6  in./in.-F) 

Coefficient  for 
Transformation 
Computation 
(E-4  in./in.-F) 

Coefficient  for 
Combined 

Stress 

(E-6  in./in.-F) 

65. 

6.3 

2.05 

-2.99 

100. 

6.42 

1.33 

-3.11 

150. 

6.6 

.89 

-3.28 

190. 

6.75 

200. 

6.78 

.66 

-3.47 

225. 

6.84 

.59 

-3,61 

250. 

6.9 

.53 

-3.76 

27  5. 

6.96 

.48 

-3.92 

300. 

7.02 

.44 

-4.08 

325. 

7.08 

.41 

-4.26 

350. 

7.14 

.38 

-4.45 

375, 

7.20 

.36 

-4.65 

400. 

7.27 

.33 

-4.86 

425. 

7.33 

.31 

-5.07 

450. 

7,39 

.30 

-5.31 

485. 

7.45 

.27 

-5.37 

500. 

7.51 

.27 

-5.82 

560. 

7,65 

.24 

-6,53 

600. 

7.75 

.0 

7.75 

610. 

7.76 

,0 

7.76 

617. 

7.77 

.0 

7.77 

635. 

7.8 

.0 

7.8 

1200. 

8.5 

.0 

8.5 

1560. 

6.87 

.0 

6.87 
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TABLE  A3.  THERMAL  PROPERTIES  USED  AS  A  FUNCTION  OF  TEMPERATURE 


Temperature 

Thermal  Conductivity 

(°F> 

(BTU/°F  sec  in.) 

0 

,0005 

400 

.000493 

600 

.000472 

800 

. 000449 

1000 

.000412 

1200 

. 00037 

1400 

.00031 

1560 

.00031 

Temperature 

Specific  Heat 

<*F) 

(BTU/fTF) 

0 

,105 

1300 

.184 

1400 

.38 

1550 

.14 

Density  .284  #/in,3 

Martensite  Start  Temperature  325°C  (617°F) 
Martensite  Finish  Temperature  260°C  (500°F) 
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BREECH  SECTION  MUZZLE  SECTION 
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OD:  8.90  INCH 
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TTT  DIAGRAM  -  TRANSIENT  COOLING  FOR  BREECH  END 
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THERMAL  AND  TRANSFORMATION  STRESS 
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FIGURE  8 
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TTT  DIAGRAM  -  TRANSIENT  COOLING  FOR  MUZZLE  END 


T 

E 

M 

P 

E 

R 

A 

T 

U 

R 

E 

I 

N 

D 

E 

G 

R 

E 

E 

S 


FIGURE  9 


in»—  orujcnco  cl  cn 


THOUSANDS  VARIATION  OF  STRESS  WITH  TIME 
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FINITE  ELEMENT  ANALYSIS  OF  FABRICS  WITH  NONLINEAR  STRESS-STRAIN  LAWS 


A.  R.  Johnson 

US  Army  Materials  and  Mechanics  Research  Center  (DRXMR-3MM) 

Watertown,  MA  02172 


ABSTRACT .  The  material  constitutive  relationships  for  woven  fabrics  are 
nonlinear.  This,  together  with  the  fact  that  transversely  loaded  fabric 
membranes  undergo  large  deformations,  presents  a  difficult  analysis  problem 
to  designers  using  fabrics  in  structures.  In  this  effort,  the  constant 
strain  triangular  element  is  used  for  the  development  of  finite  element 
gradient  and  tangent  matrices  for  the  case  when  the  material  energy  density 
functional  is  a  function  of  the  warp  and  fill  strains.  The  element  matrices 
are  derived  using  the  nonlinear  Green  strain-displacement  relations  to  describe 
the  warp  and  fill  strains  in  terms  of  the  deformations  of  the  fabric.  Biaxial 
stress-strain  data  for  a  2.6  oz/yd^  cotton  cloth  is  used  to  obtain  approximate 
stress-strain  functionals  and  an  energy  density  functional.  The  element 
is  used  to  determine  the  deformations  and  stresses  in  a  uniformly  loaded  square 
fabric  membrane. 

INTRODUCTION.  Fabrics  are  used  as  structural  components  in  lighter 
than  air  vehicles,  parachutes,  and  tents.  Recent  works1"5  have  concentrated 
on  determining  material  properties,  constitutive  relations  and  constructing 
appropiate  finite  element  algorithms  to  deal  with  the  large  deformations 
and  nonlinear  materials.  Previous  constitutive  relationships  used  stresses 
as  independent  variables  along  with  warp  (lengthwise) -to-fill  (woof)  stress 
ratio,1"'*  This  presents  a  complication  to  displacement  finite  element 
formulations. 

In  this  effort  the  idea  of  fitting  a  function  to  the  stress-strain 
data  with  the  strains  as  independent  variables  is  pursued  with  a  finite 
element  formulation  similar  to  that  used  by  Johnson6,7  to  determine  the 
deformations  of  rubber  membranes.  The  warp  and  fill  stresses  are  expressed 
via  warp  and  fill  strains.  An  energy  density  functional  for  fabric  membranes 
is  then  obtained  in  terms  of  the  fabric  warp  and  fill  strains.  Green's 
strain-displacement  relations  are  used  and  a  potential  energy  functional 
is  obtained  in  terms  of  material  displacements.  A  total  Lagrangian  formulation 
is  then  constructed  and  used  to  obtain  displacements,  strains  and  stresses 
in  a  uniformly  loaded  initially  flat  square  fabric  membrane. 

CONSTITUTIVE  RELATION.  The  stress-strain  data  for  biaxially  loaded 
woven  fabrics  indicate  that  the  response  of  a  fabric,  say  in  the  warp  direction, 
is  dependent  on  the  ratio  of  the  warp-to-f ill  stresses  (see  Refs  2,  3) .  In 
Figure  1  some  of  the  biaxial  stress-strain  data  from  reference  2  is  presented 
for  a  2.6  oz/yd^  cotton,  cloth.  This  data  was  used  to  generate  the  curves 
shown  in  Figure  2.  When  similar  curves  are  drawn  for  linear  materials, 
a  series  of  equally  spaced  constant  stress  contours  is  obtained  whose  slopes 
in  the  £MC,  plane  are  dependent  on  Poisson's  ratio.  Comparisons  indicate 
that  the  fabric  constant  load  profiles  are  associated  with  a  variable  Poisson's 
ratio,  and  that  the  magnitude  of  the  stresses  yrows  ever  more  rapidly  for 
fabrics  as  the  strains  increase.  That  is,  for  fabrics,  the  spacing  between 
the  constant  load  profiles  for  a  given  load  change  tends  to  decrease  as 


the  load  increases,  while  it  remains  constant  for  linear  materials.  Then, 
a  constitutive  relationship  for  fabrics  should  have  a  variable  Poisson's 
ratio  and  an  increasing  load-to-strain  slope  as  the  strain  increases.  After 
investigating  several  possible  functions,  an  exponential  form  was  selected, 
i.e. , 


U  ;  .1 

‘  (1) 


Upon  expanding  the  exponentials  in  (1)  we  obtain 


f— 


The  first  order  terms  in  (2),  with  T,  }yTL  constant,  represent  a  linear  material. 
With  suitable  constants  C„  >  and  functions  we  may  expect 

to  obtain  a  behavior  close  to  that  of  a  fabric. 

A  material  which  strictly  follows  the  constitutive  relationship  given 
by  (1)  can  sustain  compression  unlike  a  fabric  which  cannot.  The  maximum  com¬ 
pression  stress  which  can  be  obtained  from  (1)  is  -1  when  the  strain  is  (or, 
strains  are)  infinetly  negative.  No  negative  stresses  were  obtained  for 
the  problem  solved  in  this  effort. 

Using  the  data  from  Figure  2  and  experimenting  with  several  linear 
functions  for  and  ,  the  following  constitutive  relation  is  obtained 

N;  ej25(€1*0.5€2-2S£l€l) 

1  1  (3) 

Ni=el2S(V  ast|-25€I£I)_1 
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Constant  stress  profiles  for  (3)  are  shown  in  Figure  3.  These  contours 
agree  well  with  those  in  Figure  2  in  the  first  quadrant  and  have  the  shifting 
and  curvature  properties  similar  to  the  profiles  shown  in  Figure  2.  The 
potential  energy  can  now  be  easily  obtained  in  terms  of  the  strains. 

METHOD  QF  ANALYSIS.  For  simplicity  we  assume  that  the  potential  energy 
is  a  function  of  two  field  variables  X,  and  •  Thesefield  variables 

are  the  strains  and  in  the  detailed  analysis  presented  in  the  next 
section.  Extension  to  the  case  when  more  nodal  variables  are  used  is  straight 
forward.  X|  and  are  approximated  by  the  nodal  variables  X  and  Y  over 

an  element  domain.  That  is,  we  have 


Tr  =  F  (*„**)  = 


the  potential  energy 


(4) 


where  X  ,=  X,(X,Y) 

Then,  the  gradient  of  the  potential  energy  is 


S' 
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lx,Ml  +dXFl^2 


(6) 


The  tangent  matrix  is  written  as 


(7) 


which  in  this  case  becomes 


*f 


Sf 


}F  T  ^F  - 


(3) 


A  solution  is  defined  as  a  point  CX,V)  at  which  IT  is  a  minimum.  The 
Newton-Raphson  method  can  be  used  to  locate  stationary  points  of  TT  using 
the  element  gradient  and  tangent  matrices  as  follows 


*  M 


where 


represents  the  i'th  vector,  etc. 


(9) 


854 


GRADIENT  AND  TANGENT  MATRICES  FOR  CONSTANT  STRAIN  ELEMENT.  The  coordinate 
system  used  for  the  element  is  shown  in  Figure  4.  In  the  element  coordinate 
system,  the  strain-displacement  relations  can  be  written  as 


where  (u^/v.ur;  are  the  displacement  components  of  a  material  point  along  the 
element  coordinate  directions  #  resPect^-ve^y  * 

We  now  introduce  the  relationships  between  the  element  Cartesian  coordinates 
and  the  area  coordinates  (see  ref.  8  and  Figure  4) . 


?,*(«>.♦  Me  ^c,y,)/(2A) 

f.  ■  ( a,  ♦  +  WCM) 

W  V  4K,*C,%)/f2A) 


A 


X  de-t 


area  of  element 


a  -  xtf 

b,  *  Y-  -  YeK 

c,  *  x:  -  xi 


the  Cartesian  coordinates  of  node 
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and  C  i  ,j|K)  is  an  even  permutation  of  (1,  2,  3). 

Next,  using  the  traingular  constant-strain  element  interpolation  functions, 
we  interpolate  U*,  /\Te  and  U/G  as  follows 


r- 

c=t 

ii 

u"-(u.>;,u$)T  , 

Are=  ^nrn  5 

at"  •  l  w;*,  < ,  rr/7 

ure*  <pT\*rn  , 

u"  •(«*  ,wf 

Si) 


(11) 


where  (u*  4  AT;^  uT*  )  are  the  nodal  displacements  of  the  i 1 th  node.  With  the 
definitions  given  above,  the  strain  in  the  warp  direction  can  be  written  as 


But,  i  Si)  •  jft  ( b, ,  bi;bj) 


so  the  dot  products  become 


=  TftIb,.u.; 

*  °<b 

5  zaI>>,(»;  > 

<!>* 

1 TT?  ^  * 

and 

e. 

Similarly  ^ i 

“  fc.  * 

(12) 


(13) 


The  gradient  of  the  element's  potential  energy  is  computed  as  follows 


where 


TTe  =  5 

Ac 

ss  the  areal  energy  density  functional  given  by 

bt£.N,Cc„0*Hn»u-« 

W  =  the  work  done  on  the  element  by  external  forces, 


=  the  domain  of  the  element. 
Then,  the  gradient  becomes 


f  [ii ' 


w"’  tfAfi  ^aj;- 

= I  8.  &  dA.-  » 
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or,  in  a  shorter  notation 


and  the  element  tangent  matrix  becomes 

•s*  •  *4«.  •  & 
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where 


(IS) 


25S 


and  the  terms  in  J€,  and  Je2  are  found  from 
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(20) 


Using  the  constitutive  relation  given  in  (1)  we  can  compute  the  rate  of  change 
of  the  energy  density  function  in  (15)  and  obtain  the  coeffecients 


Now  with  a  description  of  a  fabric's  geometry,  boundary  conditions  and 
loading,  solutions  can  be  obtained  using  the  algorithm  described  above. 

Uniformly  Loaded  Square  Membrane.  One  quarter  of  a  uniformly  loaded  square 
fabric  membrane  is  shown  with  boundary  conditions  in  Figure  5.  To  obtain  a  solution 
to  this  problem  the  following  approximate  expression  for  the  work  done  was  used. 


Uj  ^^dVolume  oadxd'f  =  oCurdxd'f 

si  0 

where  q  =  the  pressure  applied  to  the  membrane. 

w  =  the  vertical  displacement  of  the  membrane, 
and  f\-  =  the  (x,y)  domain  of  the  fabric. 


(21) 
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For  the  constant  strain  element  described  above  we  find  (for  an  element  of  A). 
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(22) 


and 


where 

external 


U«  *  (  u'  •  Ar*<  ^  <  .  < .  w* ,  u; ,  AT*,  ur‘ )  T 
work  done  on  the  elements. 


and  we  in  the  portion  of  the 


The  deflections,  strains  and  stresses  for  a  uniformly  loaded  square  membrane 
with  the  boundary  conditions  given  in  Figure  5  and  a  constitutive  relation  given 
by  (3)  were  determined.  The  size  of  the  membrane  was  10  in  X  10  in,  (represents 
one  quarter  of  a  20  in  X  20  in  uniformly  loaded  membrane) .  Figure  6  shows  the 
maximum  displacement  and  load  shown  indicates  that  when  the  membrane  is  nearly 
flat  it  is  very  flexible  but  after  it  has  been  deformed  it  becomes  stiff.  The 
values  of  the  membrane  stress,  N  ,  are  shown  in  Figure  7  for  a  pressure  of  0.5 
lb/in  .  The  distribution  for  N  Xwas  entirely  symmetrical  with  respect  to  Nx  and 
is  not  shown.  This  data  indicates  that  the  stresses  are  very  low  in  the  comers 
of  the  uniformly  loaded  square  membrane  and  arc  the  largest  along  the  centerlines. 
Corresponding  values  of  the  strains  E  and  E  are-  shown  in  Fig*  8.  The  strains 
indicate  that  the  fabric  will  tend  to^ull  tight  across  the  two  centerlines  of 
the  square  but  remain  relatively  unloaded  near  the  comers. 


The  effect  of  mesh  size  on  the  accuracy  of  the  solution  was  determined  by 
using  Richardson's  extrapolation  method  to  study  the  accuracy  of  the  displacements. 
This  was  accomplished  by  assuming  the  center  deflection  of  the  square  membrane 
was  related  to  the  mesh  size  as  follows. 


UX  (M  =  w  C°)  + 


where  w(h)  is  the  center  deflection  foj  mesh  size  h  and  C,P  are  constants.  Solutions 
were  obtained  at  a  pressure  of  q  =  10~  lb/ in  .  for  6X6,  8X8  and  10  X  10  meshes 
(i.e.  for  h  =  1/6,  1/8,  1/10).  The  values  for  h  and  w(h)  were  used  to  determine 
the  convergence  rate  (P  -  1.69)  shown  in  Figure  9. 

Conclusions .  The  elastic  nonlinear  behavior  of  fabrics  can  be  modeled  by 
exponential  functions  in  which  stresses  are  determined  as  a  function  of  the 
strains.  These  stress-strain  relationships  allow  the  stresses  in  the  deformed  fabrics 
to  be  determined  by  a  nonlinear  displacement  finite  element  method.  This  approach 
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is  useful  for  determining  the  deformations,  strains  and  stresses  in  uniformly 
loaded  square  fabric  membranes.  The  approach  here  can  be  modified  and  used  for 
designing  tents  and  inflatable  fabric  structures. 
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FILL  STRAIN 


FIGURE  1.  Biaxial  fabric  test  data  for  2.6  oz/yd2  cotton 
typewriter  ribbon  cloth,  (taken  from  Ref.  2) 


FIGURE  2 


.  Contours  of  constant  stress  as  a  function  of 
€,  and  Cz  for  2.6  OZ/yd2 
ribbon  cloth. 


cotton  typerwriter 


-0.02- 

-0.03  - 


Figure  3.  Contours  of  constant  stress  as  function  of 
€,  and  for  exponential  constitutive  law. 
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pressure  on  membrane. 


^  *  H-  mesh  ^houm 


FIGURE  5.  Loading,  boundary  conditions,  and  typical 
mesh  for  flat  square  fabric  membrane. 
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FIGURE  6.  Maximum  deflection  vs  pressure  for  an  initially  flat 
square  fabric  membrane. 


FIGURE  8.  Membrane  strains  for  a  6  X  6  mesh. 
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EXPLICIT  FORMULAS  FOR  C11  PIECEWISE  HERMITE  BASIS  FUNCTIONS 


Royce  W,  Soanes,  Jr, 

US  Army  Armament,  Munitions,  and  Chemical  Command 
Armament  Research  and  Development  Center 
Large  Caliber  Weapon  Systems  Laboratory 
Benet  Weapons  Laboratory 
Watervliet,  NY  12189 


ABSTRACT,  Completely  factored  forms  of  the  piecewise  Hermite  basis 
functions  will  be  derived.  All  necessary  coefficients  for  any  level  of 
smoothness  will  be  shown  to  reside  conveniently  in  Pascal* s  triangle. 

I ,  INTRODU CTION .  We  begin  with  a  theoretical  characterization  of  the  Cn 
basis  functions  with  which  we  are  dealing.  For  a  given  node  in  R,  there  is 
a  basis  function  %j(x)  associated  with  the  jth  derivative  of  any  function  f 
at  x^,  where  j  ranges  from  o  to  n.  In  addition,  each  basis  function  is 
nonzero  on  only  two  adjacent  subintervals.  Continuing  with  the  definition  of 
the  H*s,  we  wish  an  approximation  F  to  f  of  the  form: 

F(x)  -  l  Hij(x)f  Ixi)  +  Ni+xj  (x)f  4x1+1) 

j-o 


where  xi  <■  x  <  xi+x. 

In  order  that  F  and  its  n  derivatives  will  agree  with  f  and  n  of  its 
derivatives  at  nodes  xi  and  xi+i ,  it  is  sufficient  that  the  basis  functions 
associated  with  arbitrary  node  i  obey  the  following  conditions: 

(k) 

Hij(xi)  -  6jk 


(k)  (k) 

Hij<xi-l)  *  Hi-jCxi+i)  »  0 


where  0  <  j ,k  <  n. 

On  a  given  subinterval,  therefore,  each  H.  must  obey  n+1  conditions  on  the 
left  extreme  and  trH  conditions  on  the  right  extreme.  The  H*s  may  therefore 
be  represented  by  two  distinct  polynomials  of  degree  2n+l* 

The  following  series  of  pictures  depicts  the  C4  basis  functions  (scaled) 
and  their  derivatives.  The  five  functions  across  the  top  are  the  basis 
functions  associated  with  the  Oth  through  the  4th  derivatives  of  f  and  the 
functions  underneath  them  are  their  successive  derivatives.  Note  that  the 
functions  along  the  diagonal  are  nonzero  in  the  center  while  all  off  diagonal 
functions  are  zero  there. 
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Figure  1.  C4  Basis  Functions  and  Their  Derivatives. 


Although  subsequent  analysis  will  enable  us  to  compute  all  the  basis 
functions  for  any  given  level  of  smoothness  (n)  in  an  extremely  simple  way, 
the  author  has  not  seen  anything  similar  mentioned  or  referenced  in  any  finite 
element  text  thus  far. 

II.  DERIVATION.  We  begin  by  defining  a  finite  support  Taylor  series 
(FSTS).  Take  the  ordinary  Taylor  series  for  f  around  node  x^,  truncate  it 
beyond  nth  derivative  terms,  and  multiply  each  term  by  a  function  which  will 
have  the  effect  of  (1)  not  disturbing  the  truncated  Taylor  series  at  all  at 
node  and  (2)  zeroing  the  series  and  n  of  its  derivatives  at  nodes  xi-i  and 
xi+l*  If  we  do  this  for  each  node  xi,  calling  the  result  F^Cx),  we  get  a 
global  approximation  to  f  which  agrees  with  the  truncated  Taylor  series  of  f 
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at  each  node  by  simply  summing  the  F^,  This  is  just  an  alternative  way  of 
defining  the  piecewise  Hermite  approximation  which  we  will  find  quite  useful 


where 


TS:  f(x)  =  l  f  ixi)(x-xi)j/j! 
j=Q 

n  (j)  j 

FSTS:  Fi(x)  =  £  f  txiXx-Xj.)  g-j  (Ri(x)  )/j  ! 

j-0 


(1) 

(2) 


Ri(x)  =1  if  x  <  xi_i  or  x  >  X£+i 

=  (xi’xJ/Cxi-Xi-i)  if  xi-i  ^  x  <  Xi 

=  (x-xiJ/Cxn-i'Xi)  if  xi  <  x  c  xi+i  (3) 

R^Cx)  is  just  one  minus  the  hat  function  associated  with  node  i  or  just  the 
relative  position  of  x  in  either  the  left  or  the  right  hand  subinterval.  The 
domain  of  the  g  functions  is  therefore  just  the  interval  [0,1]. 

The  objective  now  is  to  determine  the  gfs.  Since  we  want  F^  and  its 
derivatives  to  behave  in  a  certain  manner,  we  must  first  differentiate  F^(x) 
an  arbitrary  number  of  times.  Using  Leibniz's  rule  for  differentiating  a 
product,  we  have: 

(m)  “  (1)  min{j,ra>  m  j  -k  (m-k)  '  m-k 

Fi(x)  =  l  f  txi)  l  (^)(x-xi)  gj  (Ri(x))(Ri(x))  /(j-k)!  (4) 

j-0  k=0 

and  substituting  x  =  x^  in  Eq.  (4)  we  have 


(m)  ®  (i)  m  (m-j)  ’  m-j 

F±  (xi)  -  l  f  txi)(j)gj  (0) (Ri(0) ) 
j»0 


(5) 


we  may  define  Ri(0)  to  be  Ri(±e)  or  take  limits  from  either  side  of  x±. 
We  now  want  conditions  on  the  g's  which  are  sufficient  for: 


(m)  (m) 

FiCxi)  =»  f  (xi) 


and 


(m)  (m) 

Fi(xi-i)  -  FiCxi+i)  -  0 


for  0  <  m  <  n. 


(6) 


(7) 
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We  may  glean  these  conditions  from  Eqs,  (4)  and  (5).  Conditions  on  the 
g*s  sufficient  for  Eqs,  (6)  and  (7)  may  be  seen  to  be: 

gj<0)  -  1  0  <  j  <  n  (8) 

(m) 

gj(0)  =  0  0<j<n,  l<ra<  n-j  (9) 

(m) 

gj(l)  *0  0  <  j  <  n,  0  <  m  <  n  (10) 

The  2n-j+2  conditions  on  gj  may  therefore  be  met  by  a  polynomial  of 
degree  2n-j+l ;  the  product  of  gj  and  (x-x^)j  in  Eq.  (2)  is  therefore  of  degree 
2n+l  for  all  j,  as  expected* 

The  g  of  lowest  degree  is  therefore  gn,  which  has  defining  conditions: 

8n(0)  -  1 

and 

(m) 

gn(l)  =0  0  <  m  <  n 

This  g  may  be  obtained  by  inspection  and  is: 

n+1 

gn(x)  =  (1-x)  .(11) 

Now,  from  Eq.  (10),  we  may  observe  that  all  the  g’s  have  the  same  derivative 
behavior  at  x=l.  We  therefore  need  only  define  gj (x)  as  the  product  of  some 
unknown  polynomial  hj (x)  and  gn(x) : 

n+1 

gj(x)  -  hj  (x)  ( 1-x)  0  <  j  £  n  (12) 

where  hj  is  a  polynomial  of  degree  n-j : 

nrJ  k 

hj(x)  -  l  a^x  0  *  j  <  n  (13) 

k-0 

It  may  seem  at  first  glance  that  the  a*s  should  have  an  extra  subscript  - 
namely  j,  since  we  are  seeking  n+1  sets  of  coefficients.  As  will  become 
immediately  apparent,  however,  one  set  is  sufficient  and  all  other  sets  are 
subsets  of  this  one.  This  subset  property  and  the  fact  that  the  largest  set 
of  a's  may  be  obtained  in  an  almost  trivial  manner,  is  what  makes  the  result 
of  this  analysis  truly  simple  indeed. 

If  we  now  obtain  the  rath  derivative  of  gj(x)  and  evaluate  it  at  x  =  0,  we 

get: 

(m)  ®  m-k 

gj(0)  =  m!  I  ak(-l)  (n+1)  (14) 

k=0 
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Using  Eqs.  (8)  and  (9),  we  have: 


a0  =  X 


and 

to  rti— Tut 

l  ak(-l)  <n+l)  =0  for  1  <  to  <  n-j  (15) 

k=0  m-k 

This  is  a  lower  triangular  system,  which  may  be  easily  solved  for  the  a’s  by 
forward  substitution*  Note  that  the  coefficients  do  not  depend  on  j ,  so  we 
might  as  well  solve  the  largest  system  (j-0)  and  obtain  all  the  a’s,  although 
only  the  first  n-j+1  a's  are  needed  for  hj(x)* 

Solving  system  (15)  for  j-Q  and  a  few  values  of  n  gives  us  the  following 
table  of  a’s 


TABLE  1.  COEFFICIENTS  OF  ho 


n 

*1 

a2 

a3 

*4 

a5 

0 

1 

1 

1 

2 

2 

1 

3 

6 

3 

1 

4 

10 

20 

4 

1 

5 

15 

35 

70 

5 

1 

6 

21 

56 

126 

252 

Inspection  of  this  table  gives  us  a  very  simple  recursion  for  the  n  set  of 
coefficients  in  terms  of  the  n-1  set: 


n 

aQ  =*  1 


n  n  n-1 

ak  “  ak-l  +  ak  1  <  k  <  n-1 


n  n 

an  “  2an-l  (15) 

where  the  superscripts  denote  the  level  of  smoothness. 

We  therefore  have  here  nothing  more  than  one  half  of  Pascal’s  triangle, 
viewed  at  an  angle! 

Recalling  that : 


n+1 

gj (x)  -  hj (x)(l-x) 
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and  from  the  FSTS  that: 


Hij(x)  -  gj (Ri(x))(x-Xi)J /j ! 

we  have,  explicitly: 

n+1  j 

HyCx)  =  hj  (R*(x))(l-Ri(x))  (x-xi)  /j  !  (17) 

therefore,  f  may  be  approximated  on  [x^jXi+il  by: 

n  (4 )  x-xi  xi+l-x  n+1  j 

I  (f  (xi)hj  ( - )( - )  (x-Xj,) 

j*0  Xi-H-Xi  *i+l-xi 

(i)  xi+i-x  x-xi  n+1  j 

+  f  txi+i)>u  ( - )( - )  (x-Xi+i)  }/j  !  (18) 

Xi+l“xi  xi+i-xi 


In  order  to  evaluate  the  derivatives  of  the  H's,  one  may  expand  the 
polynomials  involved  and  multiply  out  or  one  may  apply  Leibniz's  rule  a  couple 
of  times.  The  latter  course  is  deemed  simpler  and  more  numerically  stable 
since  it  leaves  a  result  which  is  In  "nearly"  fully  factored  form*  The  latter 
method  was  used  to  produce  Figure  1, 
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1  ABSTRACT.  A  bivariate  C1  quadratic  B-spline  basis  for  the  space  of 
CA  piecewise  polynomials  with  total  degree  two  on  a  crisscross 
triangulation  is  given.  This  basis  has  very  important  algebraic, 
geometric  and  approximatic  properties,  and  can  be  used  in  a  variety  of 
applications.  In  particular,  it  can  be  used  adaptively  in  pattern 
recognition,  image  processing  and  data  reduction.  In  image  restoration, 
for  example,  it  gives  much  better  pictures  than  the  tensor  product 
splines  using  the  same  discrete  data. 

1.  INTRODUCTION.  Let  D  be  a  domain  in  R2  and  A 
straight  line  segments  that  partition  D  into  cells.  The 
all  functions  s  in  C^ (D)  such  that  the  restrictions  of 
cell  are  polynomials  with  total  degree  at  most  k,  that  is 

I  a  xV, 

°<i+j<k  J 

where  u  and  k  are  nonnegative  integers,  is  called  the  space  of 
bivariate  splines  with  (total)  degree  k  in  (D)  on  the  grid 
partition  A,  and  will  be  denoted  by  S^(D,A).  Clearly*  S^(D,A) 
becomes  the  trivial  space  of  polynomials  with  total  degree  k  if 
n  >  k,  More  generally,  we  could  allow  A  to  consist  of  algebraic 
curves.  Some  fundamental  tools  have  recently  been  developed  in  [3]  to 
study  these  spaces.  These  methods  could  be  used  to  determine 
dimensions,  find  locally  supported  splines,  etc.  as  in  [5]  and  [4]* 
Locally  supported  splines  with  minimum  supports  are  most  important  both 
in  theory  and  applications*  These  functions  are  generalizations  of 
univariate  B-spllnes,  and  are  sometimes  called  bivariate  B-splines, 
although  this  name  is  used  by  some  authors  to  include  all  locally 
supported  splines  which  are  strictly  positive  inside  the  supports*  When 
A  is  a  regular  grid  partition,  these  functions  can  usually  be  obtained 
by  projections  as  discussed  in  [1].  For  a  fairly  up-to-date  review  of 
the  subject  of  multivariate  splines,  the  reader  is  referred  to  the 
95-page  survey  article  [8]. 

2.  B-SFLINES  ON  CRISSCROSS  TRIANGULATIONS.  In  application,  such 
as  in  surface  fitting,  pattern  recognition,  image  processing,  data 
reduction,  etc. ,  D  Is  usually  a  rectangular  region,  and  depending  on 
the  density  and  variation  of  the  data  to  be  studied,  D  is  divided  into 
rectangular  subregions  of  different  sizes.  Let 


a  grid  of 
collection  of 

s  to  each 
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a  -  Xg  <  <  . * .  <  -  b,  and 

c  =  y0  <yj  <  ...  <  yn+1  =  d. 

Two  common  ways  to  triangulate  the  rectangular  subregions 
[x±,  xi+1]  *  [y^,  yj+1J »  i  =  0, # • ,  ,m  and  j  =  0,...,n,  are  to  draw  in 
diagonals  with  only  positive  (or  only  negative)  slopes,  or  to  draw  in 
diagonals  with  both  positive  and  negative  slopes  of  these  subregions. 
They  are  called  uni-diagonal  (or  type-1)  and  crisscross  (or  type-2) 
triangulations.  If,  in  particular,  the  partition  is  regular,  i,e. 

xi+l  -  xi  =  xi  ■  xi-l»  1  “!.•••.“>  and  yj+i  '  -  yj-i» 

j  ■  l,...,n,  then  a  unidiagonal  triangulation  becomes  a  three-direction 
mesh  and  a  crisscross  triangulation  becomes  a  four-direction  mesh.  Very 
interesting  results  on  a  three-direction  mesh  can  be  found  in  [2]  and 
[6],  It  has  been  pointed  out  in  [7],  however,  that  a  minimum  support 

for  splines  in  S^(D,A)  where  A  is  a  three-direction  mesh,  no  longer 
supports  a  nontrivial  B-spllne  if  A  becomes  irregular.  Since 
irregular  (i,e.  non-uniform)  subdivisions  are  very  important  in 
applications,  especially  In  adaptive  procedures,  one  would  wish  to 
obtain  bivariate  B-splines  that  change  continuously  with  the  grid 
lines.  On  crisscross  triangulations,  a  cubic  Cr  B-spline  is  given  in 
[4]  and  a  quadratic  C1  B-spline,  with  slightly  larger  support,  is  given 
in  [7],  These  B-splines  have  minimum  supports  and  are  continuous  in  the 
grid  lines  x  =  and  y  -  y^.  Since  lower  degree  splines  are  more 
desirable,  we  only  discuss  the  quadratic  one. 

To  give  an  expression  of  the  B-spline,  we  give  each  of  the 
polynomial  pieces,  A  bivariate  quadratic  polynomial  has  six 
coefficients  and  they  are  uniquely  determined  by  the  six  values  of  the 
polynomial  at  the  three  vertices  and  the  mid-points  of  the  three  sides 
of  a  triangle.  Let  A,  B  and  C  be  the  vertices  of  a  triangle  and 
Af,  Bf  and  C*  the  mid-points  of  the  sides  opposite  to  A,  B  and  C 
respectively.  Let  us  use  barycentric  coordinates;  that  is,  let  a,  b 
and  c  be  linear  polynomials  such  that  a(A)  -  1,  b(B)  -  1,  c(C)  *  1, 
and  the  values  of  a,  b,  c  at  the  other  vertices  are  zero.  Then  the 
quadratic  polynomial  which  takes  on  the  values  f(A),  f(B),  f(C),  f(Af), 
f (B* ) ,  f(C?)  at  the  six  points  A,  B,  C,  A',  B’,  C’,  respectively,  is 
given  by 

p(a,b,c)  *  a(2a-l)f (A)  +  b(2b-l)f(B)  +  c(2c-l)f(C) 

+  4[bcf (A* )  +  caf(B»)  +  abf(C’)]  . 

Note  that  a  +  b  +  c  =  1,  so  that  p  is  actually  a  polynomial  in  two 

variables.  In  writing  a  computer  program  to  give  p  as  a  function  of 

x  and  y,  one  could  simply  consider  a  =  0,  b  -  0,  c  =  0  as  the 

(linear)  equations  of  the  sides  opposite  to  A,  B,  C  respectively,  so 
normalized  that  a(A)  =  b(B)  =  c(C)  =  1. 
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We  now  give  a  representation  of  our  C1  bivariate  quadratic  spline 


whose  support  is  an  octagon  contained  in  the  rectangle 


[x‘ 


i-1* 


Xi+2]  X 


[yj-l 


yj+2^ 


as  in  Fig.  1.  Here,  the  values  of 


Of  course,  since 


at 


six  (appropriate)  points  of  each  triangle  are  given. 

vanishes  on  the  boundary  of  its  octagonal  support,  we  do  not  have 

to  give  its  values  there.  The  values  A.,  A.'  , .  ,  B.,  and  B!  ,  in 

1  Iri  J  J+l 

Fig.  1  are  defined  by 


Fig.  1 


Note  that  these  values  have  very  interesting  geometric  meaning.  In 
addition,  they  are  all  positive  numbers  and 

A.  +  A !  =  B  .  +  B -  1 
i  i  J  J 

for  all  i  and  j#  In  Fig.  2,  we  give  a  three-dimensional  picture  of 
Bij  with  xi-l . xi+2  =  °»  4’  8  and  yj-l . yj+2  =  0,3, 5, 6. 
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Fig-  2 

View:  (15,10,1) 
Target:  (.5,5, .5) 
Field:  150° 


3,  PROPERTIES  OF  B 

Strictly  positive  inside  its  s^PP°r^  ^^'bell-shaped  curve  as 

„„  ,«*  (v.ac.1)  =~"7  “  0  „’«p ,  ,!««!»  1.  ....  -  >, 

expected.  A  very  important  feature  tor  PP 
form  a  partition  of  unity;  tha  » 


^  .  It  is  clear  from  Fig 


1  that  is 


From  Fig*  2,  it  can  also  be  seen 


n+1  nrfl 


H-ri  urr-L 

l  I  Bij<x*y)  =  1 
j— 1  i=-l 

,  To  Kl  X  Tr  dl.  Here,  we  have  assigned 
for  all  (x,y)  in  the  rectangle  _[a,b]  Jc.i]  <  <  and 

arbitrary  values  x_2  <  *_i  <  xo  3  <  ~  nrf2  “  ®t3  ,  and 

y9<yliy0  =  C<d  =  7n+l  -  yn+2  -  yn+3‘  C°aXeSCenCe  1 

y"-s  are  allowed  as  in  the  univariate  setting.  Of  course. 

xj  .  x  then  the  joining  condition  along  this  edge  becomes  C  and 

if  X;+=l*i+1  =  x1+2  then  By  has  a  jump  on  this  edge.  See  7ig.  a.  • 


S80 


in&ili 


Fig.  3a 

xi_l=0»  xi=1-  xi+l“xi+2=4 

yi=3’  yj+l=5>  y3+2=6 


a 


iM 

ttett'm 


,  r  VJv  a 

mV 


Fig.  3b 


xi-r0’  xi=xi+rxi+2=1 

y3-l_0>  yj=3’  yj+l=5>  yj+2 


The  provision  of  coalescence  of  the  grid  lines  makes  B  more  flexible 
for  application.  It  is  interesting  to  point  out  that,  different  from 
the  univariate  setting,  the  B^'s  are  linearly  dependent.  In  fact,  we 
have  the  relationship 

n+1  mfl 

jLi  iLi <x±+i '  xi)(yj+i  ■  7j)B«(*'y)  =  0 

for  all  (x,y)  in  [a,b]  *  [c,d].  This  dependence,  however,  does  not 
diminish  the  utility  of  B^  in  various  approximation  schemes.  We 
finally  remark  in  this  section  that  the  center  crisscross  in  the  support 
of  B^  is  in  general  active,  and  becomes  inactive  when  the  partition 
becomes  regular.  For  regular  grid  partition,  B^  becomes  a  box-spline 
as  discussed  in  [1]  and  this  box-spline  was  first  constructed  in  [9]. 

4.  APPLICATION  TO  IMAGE  RECONSTRUCTION.  During  the  presentation  at 
this  "First  Army  Conference  on  Applied  Mathematics  and  computing",  we 

have  shown  C1  images  constructed  from  64  x  64  and  128  x  128 
discrete  data  of  pictures  of  a  girl,  a  portion  of  the  surface  of  the 
moon,  and  a  couple  by  simply  using  the  surface 


n*fl  nH-1  , 

z  =  I  I  x  (fCxi.y.)  +  f(Xi,y  ,)  +  f(xi+1>y.,) 

J— 1  1=-1  J  J  J 

+  f(x1+1,yJ+1))B.J(x,y) 


where  x_2  =  x_j  =  xQ>  x^  =  =  x^,  y_2  =  y^  =  yQ, 

yn+i  =  yn+2  =  yn+3,  and  the  other  x^s  and  y^'s  being  equally 

spaced.  This  surface  is  compared  to  interpolating  surfaces  using 
bilinear,  bicubic,  and  bicubic  Hermite  splines,  and  improves  each  of 
these  tensor-product  surfaces  by  at  least  0.2  db,  although  it  requires 
less  computer  time  than  the  bicubic  interpolating  tensor-product 
surface.  We  expect  that  if  variable  grid  lines  x  -  xi  and  y  =  y^ 
are  used  the  images  would  greatly  improve.  More  research  and 
experiments  are  required  in  this  direction. 
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ABSTRACT.  Origins  of  spectral  methods,  especially  their  relation  to 
the  Method  of  Weighted  Residuals,  are  surveyed.  Basic  Fourier, 
Chebyshev,  and  Legendre  spectral  concepts  are  reviewed,  and  demonstrated 
through  application  to  simple  model  problems.  Both  collocation  and  tau 
methods  are  considered.  These  techniques  are  then  applied  to  a  number 
of  difficult,  nonlinear  problems  of  hyperbolic,  parabolic,  elliptic,  and 
mixed  type.  Fluid-dynamical  applications  are  emphasized. 

I.  INTRODUCTION.  Spectral  methods  may  be  viewed  as  an  extreme 
development  of  the  class  of  discretization  schemes  known  by  the  generic 
name  of  the  method  of  weighted  residuals  (MWR)  [1].  The  key  elements  of 
the  MWR  are  the  trial  functions  (also  called  the  expansion  or  approxima¬ 
ting  functions)  and  the  test  functions  (also  known  as  weight  functions). 
The  trial  functions  are  used  as  the  basis  functions  for  a  truncated 
series  expansion  of  the  solution,  which,  when  substituted  into  the 
differential  equation,  produces  the  residual.  The  test  functions  are 
used  to  enforce  the  minimization  of  the  residual. 

The  choice  of  trial  functions  is  what  distinguishes  the  spectral 
methods  from  the  finite  element  and  finite  difference  methods.  The 
trial  functions  for  spectral  methods  are  infinitely  differentiable 
global  functions.  (Typically  they  are  tensor  products  of  the  eigenfunc¬ 
tions  of  singular  Sturm-Liouville  problems.)  In  the  case  of  finite 
element  methods,  the  domain  is  divided  into  small  elements,  and  a  trial 
function  is  specified  in  each  element.  The  trial  functions  are  thus 
local  in  character,  and  well-suited  for  handling  complex  geometries. 
The  finite  difference  trial  functions  are  likewise  local. 

The  choice  of  test  function  distinguishes  between  the  Galerkin, 
collocation,  and  tau  approaches.  In  the  Galerkin  approach,  the  test 
functions  are  the  same  as  the  trial  functions,  whereas  in  the  colloca¬ 
tion  approach  the  test  functions  are  translated  Dirac  delta  functions. 
In  other  words,  the  Galerkin  approach  is  equivalent  to  a  least  squares 
approximation,  whereas  the  collocation  approach  requires  the  differen¬ 
tial  equation  to  be  satisfied  exactly  at  the  collocation  points. 
Spectral  tau  methods  are  close  to  Galerkin  methods  but  they  differ  in 
the  treatment  of  boundary  conditions. 

The  collocation  approach  is  the  simplest  of  the  MWR,  and  appears  to 
have  been  first  used  by  Slater  [2]  in  his  study  of  electronic  energy 
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bands  in  metals.  A  few  years  later,  Barta  [3]  applied  this  method  to 
the  problem  of  the  torsion  of  a  square  prism.  Frazer,  et  al.  [4] 
developed  it  as  a  general  method  for  solving  ordinary  differential 
equations.  They  used  a  variety  of  trial  functions  and  an  arbitrary  dis¬ 
tribution  of  collocation  points.  The  work  of  Lanczos  [5]  established  for 
the  first  time  that  a  proper  choice  of  trial  functions  and  distribution 
of  collocation  points  is  crucial  to  the  accuracy  of  the  solution. 
Perhaps  he  should  be  credited  with  laying  down  the  foundation  of  the 
orthogonal  collocation  method.  This  method  was  revived  by  Clenshaw  [6], 
Clenshaw  and  Norton  [7],  and  Wright  [8].  These  studies  involved  appli¬ 
cation  of  Chebyshev  polynomial  expansions  to  initial  value  problems. 
Villadsen  and  Stewart  [9]  developed  this  method  for  boundary  value 
problems . 

The  earliest  investigations  of  the  spectral  collocation  method  to 
partial  differential  equations  were  those  of  Kreiss  and  Oliger  [10]  (who 
called  it  the  Fourier  method)  and  Orszag  [11]  (who  termed  it  pseudo- 
spectral).  This  approach  is  especially  attractive  because  of  the  ease 
with  which  it  can  be  applied  to  variable  coefficient  and  even  nonlinear 
problems.  The  essential  details  will  be  furnished  below. 

The  Galerkin  approach  is  perhaps  the  most  esthetically  pleasing  of 
the  MWR  since  the  trial  functions  and  the  test  functions  are  the  same. 
Indeed,  the  first  serious  application  of  spectral  methods  to  PDF's  — 
that  of  Silberman  [12]  for  meteorological  modelling  —  used  the  Galerkin 
approach.  However,  spectral  Galerkin  methods  only  became  practical  for 
high  resolution  calculations  of  nonlinear  problems  after  0rs2ag  [13]  and 
Eliasen,  et  al.  [14]  developed  a  transform  method  for  evaluating 
convolution  sums  arising  from  quadratic  nonlinearities.  Even  in  this 
case  spectral  collocation  methods  retain  a  factor  of  2  in  speed.  For 
more  complicated  nonlinear  terms  high  resolution  spectral  Galerkin 
methods  are  still  impractical. 

The  tau  approach  is  the  most  difficult  to  rationalize  within  the 
context  of  the  MWR.  Lanczos. [5]  developed  the  spectral  tau  method  as  a 
modification  of  the  Galerkin  method  for  problems  with  non-periodic 
boundary  conditions.  Although  it  too,  is  difficult  to  apply  to  non¬ 
linear  problems,  it  has  proven  quite  useful  for  constant  coefficient 
problems  or  subproblems,  e.g.,  for  semi-implicit  time-stepping 
algorithms . 

The  following  discussion  of  spectral  methods  for  PDE's  will  be 
organized  around  the  three  basic  types  of  systems  —  hyperbolic,  para¬ 
bolic,  and  elliptic  —  with  an  additional  section  for  a  difficult,  non¬ 
linear  problem  of  mixed  type.  Simple,  one-dimensional,  linear  examples 
will  be  provided  to  illustrate  the  basic  principles  and  details  of  the 
algorithms;  two-dimensional,  nonlinear  examples  drawn  from  fluid 
dynamical  applications  will  also  be  furnished  to  demonstrate  the  power 
of  the  method.  The  focus  will  be  on  collocation  methods,  although  some 
discussion  of  tau  methods  is  provided. 
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II.  HYPERBOLIC  EQUATIONS*  Linear  hyperbolic  equations  are  perhaps 
the  simplest  setting  for  describing  spectral  collocation  methods.  Both 
Fourier  and  Chebyshev  schemes  have  found  wide  application.  This  section 
will  first  present  the  fundamentals  of  both  approaches  and  then 
illustrate  them  on  a  nonlinear  fluid  dynamics  problem  involving  shock 
waves . 

Basic  Fourier  Spectral  Concepts.  The  potential  accuracy  of  spectral 
methods  derives  from  their  use  of  suitable  high-order  interpolation 
formulae  for  approximating  derivatives.  An  elementary  example  is 
provided  by  the  model  problem 


Ut  +  ux  =  0.  (1) 

with  periodic  boundary  conditions  on  [0,2ir]  and  the  initial  condition 


u(x,0)  ■  sin(7r  cos  x). 

(2) 

The  exact  solution 

u(x,t)  =  sin[ir  cos(x-t)] 

(3) 

has  the  Fourier  expansion 

00 

u(x,t)  =  l  u,  (t)  eikx. 

(4) 

k~ 

where  the  Fourier  coefficients 

Uj^t)  =  sin(^y-)  Jk(  it)  e“ikt 

(5) 

and  Jfc(t)  is  the  Bessel  function  of  order  k.  The  asymptotic  proper¬ 
ties  of  the  Bessel  functions  imply  that 


u^Ct)  *  0  as  k  >  ®  (6) 

for  all  positive  integers  p.  As  a  result,  the  truncated  Fourier  series 

N/?_1  -  ikx 

u  (x,t)  =  5,  11.(0  ellcx  (7) 

k=  -N/2+1 

converges  faster  than  any  finite  power  of  1/N.  This  property  is  often 
referred  to  as  exponential  convergence.  A  straightforward  integration- 
by-parts  argument  [15]  may  be  used  to  show  that  it  applies  to  any 
periodic  and  infinitely  differentiable  solution. 

The  standard  collocation  points  are 

j=0,X,*",N-l.  (8) 

Let  Uj  denote  the  approximation  to  u(x.),  where  the  time  dependence 
has  been  suppressed.  Then  the  spatial  discretization  of  Eq.  (1)  is 
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j 


(9) 


3u 

ax 


where  the  right-hand-side  is  determined  as  follows.  First,  compute  the 
discrete  Fourier  coefficients 


1 

N 


N— 1 


l 

j=0 


-ikx  j 
e  J 


Then  the  interpolating  function 


N  .  x 
2 


u(x) 


N/2-1 

I  uk  e^ 
k=  -N/2 


can  be  differentiated  analytically  to  obtain 


N/2-1 

l 


k=  -N/2-f-l 


-  ikx  A 
ik  uk  e  3 


(10) 


(U) 


(12) 


(The  term  involving  k  =  -N/2  makes  a  purely  imaginary  contribution  to 
the  sum  and  hence  has  been  dropped.)  Note  that  each  derivative  approxi¬ 
mation  uses  all  available  information  about  the  function  values.  The 
sums  in  Eqs.  (10)  and  (12)  can  be  obtained  in  0(N  Jin  N)  operations  via 
the  Fast  Fourier  Transform  (FFT) . 


An  illustration  of  the  superior  accuracy  available  from  the  spectral 
method  for  this  problem  is  provided  in  Table  I.  Shown  there  are  the 
maximum  errors  at  t  =  1  for  the  truncated  series  and  for  the  spectral 
collocation  method  as  well  as  for  second-order  and  fourth-order  finite 
difference  methods.  The  time  discretization  was  the  classical  fourth- 
order  Runge-Kutta  method.  In  all  cases  the  time-step  was  chosen  so 
small  that  the  temporal  discretization  error  was  negligible.  Because 
the  solution  is  infinitely  smooth*  the  convergence  of  the  spectral 
method  on  this  problem  is  more  rapid  than  any  finite  power  of  1/N. 
(The  error  for  the  N  =  64  spectral  result  is  so  small  that  it  is 
swamped  by  the  round-off  error  of  these  single  precision  CDC  Cyber  175 
calculations.)  In  most  practical  applications  the  benefit  of  the 
spectral  method  is  not  the  extraordinary  accuracy  available  for  large 
N  but  rather  the  small  size  of  N  necessary  for  a  moderately  accurate 
solution. 
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Table  I-  Maximum  Error  for  a  1-D  Periodic  Problem 


N 

Truncated 

Series 

Fourier 

Spectral 

2nd-0rder 

Finite 

Difference 

4th-0rder 

Finite 

Difference 

8 

9.87  (-2) 

1.62  (-1) 

1.11  (0) 

9.62  (-1) 

16 

2.55  (-4) 

4.97  (-4) 

6.13  (-1) 

2.36  (-1) 

32 

1.05  (-11) 

1.03  (-11) 

1.99  (-1) 

2.67  (-2) 

64 

6.22  (-13) 

9.55  (-12) 

5.42  (-2) 

1.85  (-3) 

128 

1.37  (-2) 

1.18  (-4) 

Basic  Chebyshev  Spectral  Concepts.  Spectral  methods  for  non-periodic 
problems  can  also  exhibit  exponential  convergence.  A  simple  example  is 
again  provided  by  Eq.  (1)  but  now  on  the  interval  [-1,1]  with  initial 
condition  u(x,0)  and  boundary  condition  u(-l,t).  Since  this  is  not  a 
periodic  problem,  a  spectral  method  based  upon  Fourier  series  in  x 
would  exhibit  extremely  slow  convergence.  However,  rapid  convergence  as 
well  as  efficient  algorithms  can  be  attained  for  spectral  methods  based 
upon  Chebyshev  polynomials.  These  are  defined  on  [-1,1]  by 

Tfl(x)  -  cos  (n  cos  ^x) .  (13) 

The  function 


u(x,t)  =  sin  onr(x-t)  (14) 

is  one  solution  to  Eq.  (1).  It  has  the  Chebyshev  expansion 


u(x,t)  =  l  u  (t)  t  (x), 
n=0  n  n 


(15) 


where 


u  (t)  sin  (-=*  -  airt)  J  (aw) 


(16) 


with 


c  - 
n 


n  =  0 
n  >  1 


(17) 


The  truncated  series 


UN(X,t)  -  l  Un(t)  Tn(x) 
n=0 


(18) 
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converges  at  an  exponential  rate.  Note  that  this  result  holds  whether 
or  not  a  is  an  integer.  In  contrast,  the  Fourier  coefficients  of 
u(x,t)  are 


—  i  _iairt  sin  ir(a+k)  i  -iairt  sin  ir(a-k) 

*Vt;  2tt  e  ci+k  2tr  e  a-k 


(19) 


For  non-integer  ot  these  decay  extremely  slowly. 

The  change  of  variables 

x  »  cos  0,  (20) 


the  definition 


v(9,t)  =  u(cos  0,t), 
and  Eq.  (13)  reduce  Eq,  (15)  to 

CO 

v(0,t)  -  £  u  (t)  cos  nQ. 

n=0  n 


(21) 


(22) 


Thus,  the  Chebyshev  coefficients  of  u(x,t)  are  precisely  the  Fourier 
coefficients  of  v(0,t).  This  new  function  is  automatically  periodic. 
If  u(x,t)  is  infinitely  differentiable  (in  x) ,  then  v(0,t)  will  be 
infinitely  differentiable  (in  0).  Hence,  straightforward  integration- 
by-parts  arguments  lead  to  the  conclusion  that  the  Chebyshev  coeffi¬ 
cients  of  an  infinitely  differentiable  function  will  decay  exponentially 
fast.  Note  that  this  holds  regardless  of  the  boundary  conditions. 

A  Chebyshev  spectral  method  makes  use  of  the  interpolating  function 


N 

u(x)  =  l  Un  Tn(x). 
n=0 

The  standard  collocation  points  are 


Xj  -  COS  N 


j  =  0,1, ,N . 


(23) 


(24) 


Thus , 


v  *  nir  j 

uj  =  K  “n  C°S  ~  * 

n=»0 


(25) 


where  Uj  is  the  approximation  to  u(xj).  The  inverse  relation  is 
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n  =  0,1  ,• • • ,N 


(26) 


G» =  sfc  L  cj_1  uj  cos  ^  ’ 


where 

(2  j  =  0  or  N 

fl  1  <  j  <  N-l 

The  analytic  derivative  of  this  function  is 


'j 


3u 

3x 


N 

1 


n=0 


(x). 


where 


*(1) 

u 

N+l 

.(1) 

u 

N 


_  *<l) 

cn  °n 


0 

0 


*(1) 


un+2 


+  2(n+l)un+1, 


n  =  N-l,N-2,*  *  * ,0. 


(27) 


(28) 


(29) 


(See  [15]  for  the  derivation  of  this  recursion  relation.)  The  Chebyshev 
spectral  derivatives  at  the  collocation  points  are 


3  u  | 
3x  'j 


N 

l 


n=0 


*  jn 

cos  -  . 

N 


(30) 


Special  versions  of  the  FFT  may  be  used  for  evaluating  the  sums  in  Eqs. 
(26)  and  (30).  The  total  cost  for  a  Chebyshev  spectral  derivative  is 
thus  0(N  in  N)  . 


The  time-stepping  scheme  for  Eq.  (1)  must  use  the  boundary  condi¬ 
tions  to  update  uN  (at  x  =  -1)  and  the  approximate  derivatives  from 
Eq.  (30)  to  update  u.  for  j=0 , 1 , • • • ,N-1 .  Note  that  no  special 
formula  is  required  for  Jthe  derivative  at  j  -  0  (or  x  =  +1)* 

Results  pertaining  to  a  =  2.5  at  t  =  1  for  a  truncated  Chebyshev 
series,  a  Chebyshev  spectral  method,  a  Fourier  spectral  method,  and  a 
second-order  finite  difference  method  are  given  in  Table  II.  For  this 
non-periodic  problem  Fourier  spectral  methods  are  quite  inappropriate, 
but  the  Chebyshev  spectral  method  is  far  superior  to  the  finite 
difference  method. 


889 


The  Chebyshev  collocation  points  are  the  extreme  points  of  tn(x). 
Note  that  they  are  not  evenly  distributed  in  x,  but  rather  are 
clustered  near  the  endpoints.  The  smallest  mesh  size  scales  as  1/N  . 
While  this  distribution  contributes  to  the  quality  of  the  Chebyshev 
approximation  and  permits  the  use  of  the  FFT  in  evaluating  the  series, 
it  also  places  a  severe  time-step  limitation  on  explicit  methods  for 
evolution  equations. 


Table  II.  Maximum  Error  for  a  1-D  Dirichlet  Problem 


N 

Truncated 

Series 

Chebyshev 

Spectral 

Fourier 

Spectral 

Finite 

Difference 

4 

1.24 

(0) 

1.49  (0) 

1.85  (0) 

1.64  (0) 

8 

1.25 

(-1) 

6.92  (-1) 

1.92  (0) 

1.73  (0) 

16 

7.03 

(-6) 

1.50  (-4) 

2.27  (0) 

1.23  (0) 

32 

1.62 

(“13) 

3.45  (-11) 

2.28  (0) 

3.34  (-1) 

64 

1.79 

(-13) 

9.55  (-11) 

2.27  (0) 

8.44  (-2) 

Application  to  Two-dimensional,  Supersonic  Flow.  Spectral  methods  have 
recently  been  applied  successfully  to  the  nonlinear  hyperbolic  system 
of  equations  which  describes  a  two-dimensional  inviscid  gas  [16,17]. 
The  most  serious  complication  over  the  simple  model  problems  discussed 
above  occurs  when  shock  waves  are  present.  If  the  shock  occurs  in  the 
interior  of  the  domain,  then  the  truncated  series  for  the  discontinuous 
flow  variables  converges  very  slowly.  Elaborate  filtering  strategies 
appear  necessary  to  extract  useful  information  from  a  calculation  of 
such  a  situation  [17,18].  This  difficulty  disappears,  however,  when  the 
shock  occurs  at  the  boundary  of  the  domain,  as  in  shock-fitting  as 
opposed  to  shock-capturing  calculations. 

A  schematic  of  the  type  of  spectral  shock-fitted  calculations 
described  below  is  illustrated  in  Fig.  1.  At  time  t  =  0  an  infinite, 
normal  shock  at  x  =  0  separates  a  rapidly  moving,  uniform  fluid  on  the 
left  from  the  fluid  on  the  right  which  is  in  a  quiescent  state  except 
for  some  specified  fluctuation.  The  initial  conditions  are  chosen  so 
that  in  the  absence  of  any  fluctuation  the  shock  moves  uniformly  In  the 
positive  x-direction  with  a  Mach  number  (relative  to  the  fluid  on  the 
right)  denoted  by  M  .  In  the  presence  of  fluctuations  the  shock  front 
will  develop  ripples.  The  shape  of  the  shock  is  described  by  the  func¬ 
tion  xg(y,t).  The  numerical  calculations  are  used  to  determine  the 
state  of  the  fluid  in  the  region  between  the  shock  front  and  some 
suitable  left  boundary  x^(t)  and  also  to  determine  the  motion  and 
shape  of  the  shock  front  itself. 
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Figure  1  is  taken  from  a  shock/ turbulence  calculation  [19]  in  which 
the  downstream  fluctuation  is  a  plane  vorticity  wave  that  is  periodic  in 
y  with  period  y  .  Because  of  the  initial  value  nature  of  the  calcula¬ 
tion,  the  fluid  motion  behind  the  shock  is  not  periodic  in  x,  as  Fig.  1 
makes  abundantly  clear.  The  interesting  physical  domain  is  given  by 

xL(t)  s  x  s  xg(y,t) 

0  «  y  <  y%  (31) 

t  >  0. 

The  change  of  variables 

x  -  x^t) 

X  “  xg(y,t)  - TJt) 

Y  =  y/yl  (32) 

T  =  t 


produces  the  computational  domain 

0  <  X  <  1 
0  <  Y  <  1 
T  >  0. 


(33) 


The  fluid  motion  is  modeled  by  the  two-dimensional  Euler 
equations.  In  terms  of  the  computational  coordinates  these  are 


QT  +  S  Qx  +  C  Qy 


0, 


(34) 


where  Q  =  (P,u,v,S), 


B  = 


and 


U 


YX 


U 


—  X 

Y  x 

a2 

—  X  0 

Y  y 

0  0 


YX> 

0 

u 

0 


0 

0 

0 

u 


(35) 
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(36) 


The  contra variant  velocity  components  are  given  by 


U  =  X  +  uX  +  vX 

and  1  x  y 

V  -  Yt  +  uYx  +  vYy. 


(37) 


A  subscript  denotes  partial  differentiation  with  respect  to  the 
indicated  variable,  P,  a,  and  S  are  all  normalized  by  reference  con¬ 
ditions  at  downstream  infinity;  u  and  v  are  velocity  components  in 
the  x  and  y  directions,  both  scaled  by  the  characteristic  velocity 
defined  by  the  square  root  of  the  pressure-density  ratio  at  downstream 
infinity,  A  value  y  =  1.4  has  been  used. 


Let  n  denote  the  time  level  and  At  the  time  increment.  The  time 
discretization  of  Eq.  (34)  is 


Q  -  [l  -  AtLn]Qn 
Q11*1  =  J  [Qn  +  (1  -  AtL)Q]  , 


(38) 

(39) 


where  L  denotes  the  spatial  discretization  of  +  C9y*  The  solu¬ 

tion  Q  has  the  Chebyshev  -  Fourier  series  expansion 


M  N/2-1  „  .  „ 

Q(X,Y,T)  -  l  I  Q  (T)  t  (S)e2,rlqY, 
p=0  q— N/2  Pq  P 

where  £  =  2X-1 .  The  derivatives  Qx  and  Qy  are  approximated  by 

M  N/2-1 


(40) 


x  -  *  L  l  qJJ’0)(T)t  (ne2,tiqY, 
X  p=0  q  —N/2  pq  P 


Qy  =  2  I  I 

=0  q  =- 
M  N/2-1 

Q  v  =  2m  £  l 


u  u 

p=0  q=-N/2 


Q'!,1)(I)Tp(5)e2'i,Y, 


(41) 


(42) 


(1,0) 

where  Q  is  computed  from  Qna  in  a  manner  analogous  to  Eq.  (29), 

and 


*pq 


Q^1)  =  i  q  Q  . 

pq  pq 


(43) 
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As  a  general  rule  the  correct  numerical  boundary  conditions  for  a 
spectral  method  are  the  same  as  the  correct  analytical  boundary  condi¬ 
tions.  The  global  nature  of  the  approximation  avoids  the  need  for 
special  differentiation  formulae  at  boundaries.  At  the  same  time 
spectral  methods  are  quite  unforgiving  of  incorrect  boundary  condi¬ 
tions.  The  inherent  dissipation  of  these  methods  is  so  low  that 
boundary  errors  quickly  contaminate  the  entire  solution.  In  many  fluid 
dynamical  applications  the  computational  region  must  be  terminated  at 
some  finite,  artificial  boundary.  The  difficulty  at  "artificial" 
boundaries  is  that  analytically  correct,  fully  nonlinear  boundary  condi¬ 
tions  for  systems  are  seldom  known.  One  example  of  a  workable  artifi¬ 
cial  boundary  condition  for  the  Euler  equations  is  given  in  Ref*  [20]. 

The  most  critical  part  of  the  calculation  is  the  treatment  of  the 
shock  front.  The  shock-fitting  approach  used  here  is  desirable  because 
it  avoids  the  severe  post-shock  oscillations  that  plague  shock-capturing 
methods.  The  time  derivative  of  the  Rankine-Hugoniot  relations  provides 
an  equation  for  the  shock  acceleration.  This  equation  is  integrated  to 
update  the  shock  position  (see  [20]  for  details).  This  method  is  a 
generalization  of  the  finite  difference  method  developed  by  Pao  and 
Salas  [21]  for  their  study  of  the  shock/vortex  interaction. 

The  nonlinear  interaction  of  plane  waves  with  shocks  was  examined  at 
length  in  [19].  The  numerical  method  used  there  was  similar  to  the  one 
described  above  but  employed  second-order  finite  differences  in  place  of 
the  present  Chebyshev-Fourier  spectral  discretization.  Detailed 
comparisons  were  made  in  [19]  with  the  predictions  of  linear  theory 
[22].  The  linear  results  turned  out  to  be  surprisingly  robust, 
remaining  valid  at  very  low  (but  still  supersonic)  Mach  numbers  and  at 
very  high  incident  wave  amplitudes.  The  only  substantial  disagreement 
occurred  for  incident  waves  whose  wave  fronts  were  nearly  perpendicular 
to  the  shock  front.  This  type  of  shock-turbulence  interaction  is  a  use¬ 
ful  test  of  the  spectral  technique  because  the  method  can  be  calibrated 
in  the  regions  for  which  linear  theory  has  been  shown  to  be  valid. 

The  most  reliable  numerical  results  can  be  obtained  for  the  acoustic 
responses  to  acoustic  waves.  Unlike  the  vorticity  responses,  these 
require  no  differentiation  of  the  flow  variables,  thus  eliminating  one 
extra  source  of  error.  Moreover,  the  acoustic  reponse  stretches  much 
further  behind  the  shock  than  the  vorticity  response,  thus  providing 
greater  statistical  reliability,  Vorticity  response  results  are 
reported  in  [23].  The  incident  pressure  wave  is  taken  to  be 

i(k.  •  x  -  (u.t) 

p{  -  AJ  e  "  (44) 

where  k-^  *  (k-^  ^k^  y),  U)  =  Mga^  ^1  x  +  al  ^1  and  A'  the 
tude.  In  terms  of  the  incidence  angle  0^,  k^  =  (k^  cos  9^,k^  sin  9^). 
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The  linearized  transmitted  acoustic  wave  can  be  expressed  in  the  same 
manner  with  all  subscripts  changed  from  1  to  2,  The  amplification 
coefficient  for  the  transmitted  acoustic  wave  is  then  the  ratio 


AJ/AJ  .  (45) 

Figure  2  indicates  the  transmission  coefficient  extracted  from  the 
computation.  At  each  fixed  value  of  X  we  perform  a  Fourier  analysis 
in  Y  of  the  pressure.  The  Fourier  coefficient  for  q  =  1  provides 
the  amplitude  A'.  In  order  to  reduce  the  transients  that  would 
accompany  an  abrupt  start  of  the  calculation  at  full  wave  amplitude,  an 
extra  factor  of  s(t)  is  inserted  into  Eq.  (44),  where 


!3(t/tg)2  -  2(t/tg)3 
1 


0  <  t  <  t 

s 

t  >  t 

s 


(46) 


The  start-up  time  tg  is  some  multiple  (typically  V2  )  of  the  time  it 
takes  the  shock  to  encounter  one  full  wavelength  (in  the  x-direction)  of 
the  incident  wave.  The  ratio  AJ/AJ  is  plotted  in  Fig,  2  as  a  function 
of  the  mean  value  of  the  physical  coordinate  x  corresponding  to  X, 
The  start-up  time  for  this  Mach  3  case  is  tg  -  0.56,  The  average  of 
the  x-dependent  responses  between  the  start-up  interval  and  the  shock 
produces  the  computed  transmission  coefficient.  The  standard  deviation 
of  the  individual  responses  serves  as  an  error  estimate. 


The  dependence  upon  incidence  angle  of  the  acoustic  transmission  co¬ 
efficient  for  A'  -  0,001  and  Mg  =  3  waves  is  displayed  in  Fig.  3. 
As  is  discussed  in  [19],  linear  theory  is  quite  reliable  at  angles 
below,  say,  45°.  Figure  3  contains  results  from  both  spectral  and 
finite  difference  calculations.  The  finite  difference  results  were 
obtained  with  the  same  second-order  MacCormack's  method  that  was 
described  in  [19]  except  that  periodic  boundary  conditions  (rather  than 
stretching)  were  employed  in  the  y-direction.  The  finite  difference  grid 
was  64  x  16  and  these  calculations  used  a  CFL  number  of  0.70.  The 
spectral  grid  was  32  x  8, and  the  CFL  number  was  0.50.  Figure  3  shows 
that  both  methods  produce  the  same  results.  A  head-to-head  comparison  of 
both  methods  for  the  9i  “  10°  case  is  provided  in  Table  III.  The 
"exact"  value  is  taken  from  linear  theory  [22]  .  Since  the  amplitude  of 
the  incident  acoustic  wave  is  so  small,  it  should  come  as  no  surprise 
that  four  points  in  the  y-direction  suffice  for  the  spectral  calcula¬ 
tion.  Note  that  the  standard  deviations  are  substantially  smaller  for 
the  spectral  method.  These  results  suggest  that  the  spectral  method 
requires  only  half  as  many  grid  points  in  each  coordinate  direction. 
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Table  III.  Grid  Dependence  of  Acoustic  Transmission  Coefficient 


Grid 

Finite 

Difference 

Chebyshev- 
Fourier  Spectral 

16  x  4 

6.403  ±  2.652 

7.257  ±  0.587 

16  x  8 

6.427  ±  2.626 

7.257  ±  0.587 

32  x  4 

7.105  ±  0.453 

7.158  ±  0.022 

32  x  8 

7.134  ±  0.471 

7.158  ±  0.022 

32  x  16 

7.139  ±  0.497 

7.158  ±  0.022 

64  x  16 

7.163  *  0.078 

7.157  ±  0.017 

128  x  16 

7.152  -k  0.022 

"exact" 

7.156 

7.156 

III.  PARABOLIC  EQUATIONS*  The  nonlinear,  parabolic  system  formed 
by  the  incompressible,  Navier-Stokes  equations  was  the  focus  of  much  of 
the  early  development  and  application  of  spectral  methods  to  large-scale 
fluid  dynamical  problems.  Fourier  spectral  methods  have  been  the 
obvious  choice  for  the  simulation  of  homogeneous,  isotropic  turbulence 
[24].  For  shear  flows,  however,  non-periodic  boundary  conditions  are 
required.  So  far,  Chebyshev  spectral  methods  have  been  favored  for 
these  applications  [25,26,27].  Nevertheless,  Legendre  spectral  methods 
are  a  viable  alternative  and  of  late  they  have  been  attracting  some 
attention.  This  section  will  present  a  discussion  of  the  implementation 
of  Legendre  spectral  methods  and  will  then  compare  them  with  Chebyshev 
spectral  methods  for  the  one-dimensional  heat  equation.  This  section 
will  close  with  a  description  of  a  promising  semi-implicit  time-stepping 
scheme  for  the  Navier-Stokes  equations. 

Basic  Legendre  Spectral  Concepts.  A  Legendre  spectral  method  on  [-1,1] 
makes  uses  of  the  interpolating  function 

N  A. 

u(x)  =  l  un  ?n(x),  (47) 

n-0 

where  PR(x)  *s  the  Legendre  polynomial  of  degree  n.  Closed  form 
expressions  for  these  polynomials  are  well-known,  albeit  clumsy.  The 
computationally  preferred  way  to  evaluate  the  polynomials  is  through  the 
recursion  relation 


Vx)  “  1 

P1(x)  =  x 

and  for  n  >  2 

n  Pn(x)  =  (2n-l)xPn-1(x)  -  (n-1)  Pn_2(x).  (48) 
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Unlike  the  case  with  Fourier  and  Chebyshev  collocation  methods, 
there  is  no  tidy  expression  for  the  Legendre  collocation  points.  Appeal 
must  be  made  to  the  theory  of  numerical  quadrature  [28  J  *  The  presence 
of  boundary  conditions  at  both  endpoints  makes  it  desirable  to  include 
-1  and  +1  in  the  set  of  collocation  points.  Subject  to  this  con¬ 
straint,  the  most  accurate  quadrature  formula  for  the  discrete  Legendre 
coefficients  is  the  Gauss-Lobatto  rule 


where 


N 


l 

j=0 


Xq  =  +i,  XN  =  -1 

The  weights  are 


Wj  Pn^Xj^  Uj’  0  "  °»1>'**>N  (49) 

and  Xj  for  1  <  j  <  N-l  are  the  roots  of 

- - ±-_ -  ,  (50) 

J  N(lH-l)  PjJCXj) 


and 


c  - 
n 


n  -  0,1,-** ,N-1 

(51) 

n  =  N 


The  interior  collocation  points  must  be  determined  numerically.  This 
quadrature  rule  yields  the  exact  Legendre  coefficients  if  u(x)  is  any 
polynomial  of  degree  less  than  N.  Its  inverse  relation  is 


N  A 

u.  =  y  up  (x , )  i 
3  n=0  n  n  j 


(52) 


The  analytic  derivative  of  the  interpolating  function  in  Eq.  (47)  is 

N 


where 


#1  =  0 


—  =  y  u^i;  p  fx) 

3x  Ln  un  V*'* 
n=0 


(53) 


:<n 


(54) 


1  A(l)  1  "(1)  ,  ~ 

-  11  =  IIs  *  +  ll 

2n+l  n  2n+5  n+2  n+1 


n  »  N-l ,N-2 ,  •  •  • ,0. 


896 


Since  fast  transform  techniques  are  not  available  for  the  Legendre 
basis  functions,  there  is  no  particular  advantage  to  computing  3u/3x| 
by  applying  Eqs.  (49),  (54)  and  (53)  rather  than  by  following  Eq.  (49) 
with 


3u  | 

^  'j 


N 


l 

n=0 


u 

n 


P'(x  .). 
nv  3 


(55) 


In  fact,  for  a  collocation  method  it  is  faster  still  to  perform  this 
entire  process  by  a  single  matrix-vector  multiplication.  For  that 
matter  the  Chebyshev  collocation  differentiation  operator  may  also  be 
represented  by  a  matrix.  Timing  studies  [29]  on  the  CDC  Cyber  175  have 
indicated  that  even  for  N  =  16,  the  Chebyshev  matrix-multiply  differen¬ 
tiation  procedure  is  substantially  faster  than  one  based  on  assembly 
language  fast  transforms.  Moreover,  the  matrix-multiply  procedure  does 
not  suffer  the  sort  of  speed  degradation  that  afflicts  the  transform 
procedure  whenever  N  is  not  an  integral  power  of  2. 


The  heat  equation 


dn  a2u 


(56) 


is  the  natural  parabolic  linear  model  problem.  The  spatial  domain  Is 
[-1,1],  the  initial  condition  is 

u(x,0)  -  sin  ttx  (57) 

and  the  boundary  conditions  are 


u(— 1 , t)  =  0 
u(+l,t)  =  0. 


(58) 


The  exact  solution  is  then 

-w2  t 

u(x,0)  =  e  sin  itx.  (59) 

The  time  differencing  is  again  the  classical  fourth-order  Runge-Kutta 
scheme. 


In  addition  to  spectral  collocation  and  series  truncation  solutions, 
we  will  also  present  spectral  tau  results.  Let  u^t)  for  n=0,l,***,N 
denote  the  Legendre  coefficients  of  the  tau  approximation  to  u(x,t). 
The  semi-discrete  tau  equations  are 


du 

n 

dt 


n  =  0, !,*•■*  ,N-2  (60) 
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with 


n 

i 


n=0 


n  even 


(61) 


N 

u  =  0. 
n 

n  odd 

The  Legendre  coefficients  of  the  approximation  to  the  second  spatial 
derivative  u£2)(t)  can  be  obtained  from  un(t)  by  two  applications  of 
the  recursion  relation  in  Eq.  (54).  In  this  tau  approximation  the 
dynamical  equations  for  the  two  highest-order  coefficients  are  dropped 
in  favor  of  the  equations  for  the  boundary  conditions.  Equation  (61) 
follows  from  the  property 

Pn(±l)  =  (±l)n.  (62) 

Since  the  Chebyshev  polynomials  also  satisfy  Eq.  (62),  the  Chebyshev 
tau  equations  for  Eq.  (56)  are  the  same  as  Eqs.  (60)  and  (61).  Of 
course,  Eq.  (29)  is  invoked  for  the  derivative  coefficients  instead  of 
Eq.  (54). 


The  results  at  t  =  1  are  given  in  Tables  IV  and  V.  The  maximum 
errors  shown  there  have  been  boosted  up  by  the  factor  e^2  so  that  they 
represent  relative  errors.  On  the  whole  the  collocation  results  are  the 
best.  Moreover,  except  for  the  truncated  series  results,  the  Legendre 
approximations  are  superior  to  the  Chebyshev  ones.  Lanczos  [30]  has 
discussed  some  circumstances  under  which  Legendre  approximations  are 
superior  to  Chebyshev  ones.  It  goes  almost  without  saying  that  finite 
difference  results  are  far  inferior  to  any  of  these  spectral  approxima¬ 
tions  , 


Table  IV.  Maximum  Error  for  Legendre  Approximations  to  the 
Beat  Equation 


N 

Truncated  Series 

Tau 

Collocation 

8 

6.65 

(-4) 

6.85 

(-4) 

2.40 

(-5) 

10 

1.72 

(-5) 

1.07 

(-5) 

1.50 

(-7) 

12 

3.06 

(-7) 

1.54 

(-7) 

1.38 

(-9) 

14 

3.50 

(-9) 

1.86 

(-9) 

4.81 

(-10) 

16 

3.88 

(-11) 

1,15 

(-10) 

9.98 

(-11) 
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Table  V.  Maximum  Error  for  Chebyshev  Approximations  to  the 
Heat  Equation 


N 

Truncated  Series 

Tau 

Collocation 

8 

2.44 

(-4) 

1.61 

(-3) 

4.58 

(-4) 

10 

5.76 

(-6) 

2.12 

(-5) 

8.25 

(-6) 

12 

9.42 

(-8) 

3.19 

(-7) 

1.01 

(-7) 

14 

1.14 

(-9) 

3.35 

(-9) 

1,10 

(’9) 

16 

1.05 

(-11) 

8.39 

(-11) 

2.09 

(-11) 

The  time-step  restriction  for  explicit  Legendre  or  Chebyshev  methods 
for  the  heat  equation  is  very  severe,  scaling  as  1/N4.  This  can  pose 
quite  a  barrier  to  large-scale  calculations  for  which  a  relative  accura¬ 
cy  of  0.1%  or  so  will  suffice.  Fortunately,  many  large-scale  calcula¬ 
tions  can  be  split  into  one-dimensional,  inhomogeneous  counterparts  of 
Eq.  (56)  and  efficient  implicit  schemes  are  available  for  this  linear, 
constant  coefficient  equation.  They  rely  on  reducing  the  Legendre  (or 
Chebyshev)  tau  equations  to  a  system  which  is  nearly  tridiagonal.  The 
Legendre  tau  equations  for  a  Crank-Nicolson  temporal  discretization  of 
Eq.  (56)  are 

.  2Xe  _  Xe  ,  . 

X  —  r  i  _  n+2  i  —  t  n+4  — 

(2n-l)(2n-3)  un-2  L  (2n-l)(2n+3) J  Un  (2n+3)(2n+5)  Un+2 

1 _ y _ i!»« _ j  + _ hi* _ ? 

(2n*l) (2n-3)  n-2  (2n-l)  (2rH-3)  n  (  2rri-3  )  (  2n+5  )  xn+2 

n  =  2,3 ,• • • ,N,  (63) 


where  X  =  -At/2  with  At  the  time-step,  the  coefficients  u  on  the 
left-hand  side  are  at  t  +  At, 


f  =  u  (t)  +  \  it  u<2)(t), 
n  n  z  n 

and 

0  *  n  <  N 
n  >  N 


(64) 


(65) 


Equation  (63)  for  even  n  plus  the  first  of  Eqs.  (61)  from  a  linear 
system  which  is  tridiagonal  except  for  the  boundary  condition 
equation.  This  is  cheap  to  invert.  The  odd  coefficients  display  a 
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similar  structure.  The  Chebyshev  tau  version  of  Eq.  (63)  is  available 
in  [15]  and  [31]. 

Application  to  Channel  Flow.  Several  three-dimensional  Navier-Stokes 
algorithms  have  been  developed  which  incorporate  the  quasi-tridiagonal 
structure  of  the  Chebyshev  tau  equations  for  the  second  derivative  in 
semi-implicit  schemes  which  treat  the  constant  coefficient  diffusion 
term  implicitly  [25-27].  In  practice  this  device  has  permitted  time- 
steps  several  orders  of  magnitude  larger  than  the  explicit  diffusion 
limit.  Unfortunately,  the  quasi-tridiagonal  structure  is  lost  even  for 
a  linear,  variable  viscosity  coefficient.  An  effective  iterative  scheme 
for  this  more  general  case  has  recently  been  proposed  by  Malik,  et  al. 
[32],  This  approach  will  be  described  here  in  its  two-dimensional 
setting. 

The  rotation  form  equations  for  two-dimensional  channel  flow  are 
ut  -  v(vx-  uy)  +  Px  =  (yux)x  +  (uuy)y 

vt  +  «(▼*■  Uy)  +  Py  ”  (yvx^x  +  (>jvy)y  (66) 

ux  +  Vy  =  0, 

with  periodic  boundary  conditions  in  x  and  no-slip  boundary  conditions 
at  y  =  ±1.  The  variable  P  denotes  the  total  pressure.  The 
viscosity  y  is  presumed  to  depend  upon  y. 

A  useful  discretization  employs  Fourier  series  in  x  and  Chebyshev 
series  in  y.  The  pressure  gradient  term  and  the  incompressibility  con¬ 
straint  are  best  handled  implicitly.  So,  too,  are  the  vertical  diffu¬ 
sion  terms  because  of  the  fine  mesh-spacing  near  the  wall.  The  variable 
viscosity  prevents  the  standard  Poisson  equation  for  the  pressure  from 
decoupling  from  the  velocities  in  the  diffusion  term.  The  algorithm 
described  in  [26]  appears  to  be  a  good  starting  point*  A  Crank-Nicolson 
approach  is  used  for  the  implicit  terms  and  Adams-Bashforth  for  the 
remainder.  After  a  Fourier  transform  In  x,  the  equations  for  each 
wavenumber  k  have  the  following  implicit  structure 


u  —  V2  At (yUy)y  +V2^tikP  = 


v  -  V2  At(y  Vy  )y  +  V2  A  tP  y  -  (67) 

iku  +  vy  =  0. 

Fourier  transformed  variables  are  denoted  by  hats,  the  subscript  y 
denotes  a  Chebyshev  spectral  derivative,  and  At  is  the  time  increment. 
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The  algorithm  in  [26]  was  devised  for  constant  viscosity,  in  which 
case  the  Eqs.  (67)  can  be  reduced  to  essentially  a  block-tridiagonal 
form.  This  cannot  be  done  in  the  present,  more  general  situation.  We 
advocate  solving  these  equations  iteratively  after  applying  a  finite 
difference  pre-conditioning. 


The  interesting  physical  problems  have  high  Reynolds  number,  i.e., 
low  viscosity*  Thus  the  first  derivative  terms  in  Eqs.  (67)  predomi¬ 
nate.  The  effective  pre-conditioning  of  them  is  crucial.  Four  possi¬ 
bilities  have  been  considered.  The  eigenvalues  of  pre-conditioned 
iterations  for  the  model  scalar  problem  ux  •  f  with  periodic  boundary 
conditions  on  [0 , 2tt  ]  are  given  for  each  possibility  in  Table  VI.  The 
term  aAx  is  the  product  of  a  wavenumber  ot  and  the  grid  spacing  Ax. 
It  falls  in  the  range  0  <  |aAx|  <  it.  For  the  staggered  grid  case  the 
discrete  Eqs.  (67)  are  modified  so  that  the  velocities  and  the  momentum 
equations  are  defined  at  the  cell  faces  yj  -  cos(irj/N),  j=0,l,--*,N, 
whereas  the  pressure  and  the  continuity  equation  are  defined  at  the  cell 
centers  y j  _  ly  =  cos(tt  (  j- V2  )/N) ,  j=l ,  •  *  •  ,N.  Fast  cosine  transforms 


enable  interpolation  between  cell  faces  and  cell  centers  to  be 
implemented  efficiently.  The  staggered  grid  for  the  Navier-Stokes 
equations  has  the  advantage  that  no  artificial  boundary  condition  is 
required  for  the  pressure  at  the  walls. 


Table  VI.  Pre-conditioned  Eigenvalues  for  a  One-dimensional 
First  Derivative  Model  Problem 


PRE-CONDITIONING 


EIGENVALUES 


Central  Differences 


aAx 


sin  (aAx) 


One-sided  Differences 


-i(aAx/2)  aAx/ 2 

sin(  (aAx) / 2) J 


High  Mode  Cut-off 


aAx 


sin(aAx) 
0 


0  S  I aAx|  s  (2tt/3) 
(2tt/3)  <  laAx!  <  tt 


Staggered  Grid 


(aAx)/2 

sin[’(aAxT/2y 
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The  actual  eigenvalues  for  pre-conditioned  iterations  of  Eqs.  (67) 
are  displayed  in  Fig.  4.  The  model  problem  estimates  the  eigenvalue 
trends  surprisingly  well  considering  that  it  is  just  a  scalar  equation, 
has  only  first  derivative  terms  and  uses  Fourier  series  rather  than 
Chebyshev  polynomials. 

The  preceding  results  indicate  that  the  staggered  grid  leads  to  the 
most  effective  treatment  of  the  first  derivative  terms.  The  condition 
number  of  the  pre-conditioned  system  is  reasonably  small  and  no  resolu¬ 
tion  is  lost  by  a  high  mode  cut-off.  (Although  it  is  possible  to  devise 
a  high-mode  cut-off  which  avoids  the  small  eigenvalues  shown  in  the 
figures,  some  of  the  spectral  resolution  is  thereby  lost.)  A  simple  and 
effective  iterative  scheme  for  this  system  with  its  complex  eigenvalues 
is  a  minimum  residual  method  [32].  At  a  Reynolds  number  of  7500  each 
iteration  reduces  the  residual  by  almost  an  order  of  magnitude. 

Table  VII,  which  is  taken  from  [32],  presents  a  comparison  of  the 
accuracy  of  the  Chebyshev  discretization  In  y.  The  two  codes  are 
otherwise  identical.  The  Initial  condition  consisted  of  Poiseuille  flow 
plus  a  small  amount  of  a  linearly  unstable  eigenmode.  The  table 
compares  the  computed  growth  rate  of  this  perturbation  with  the  theoret¬ 
ical,  linear  result  after  100  time-steps. 


Table  VII.  Percent  Error  in  Growth  Rate 


N 

Finite  Difference 

Spectral 

8 

4470 

3210 

16 

337 

74.5 

32 

147 

0.097 

64 

39.5 

0.071 

128 

10.0 

256 

2.4 

IV.  ELLIPTIC  EQUATIONS.  Fruitful  nonlinear  applications  of 
spectral  methods  developed  the  latest  for  equations  of  elliptic  type. 
Unlike  hyperbolic  or  parabolic  equations,  for  which  explicit  schemes  can 
often  be  tolerated,  ellipic  equations  virtually  require  implicit  itera¬ 
tive  schemes  in  practical  situations.  It  was  only  a  few  years  ago  that 
Morchoisne  [33]  and  Orszag  [34]  proposed  preconditioning  the  spectral 
collocation  equations  by  finite  difference  operators.  More  recently 
still,  effective  spectral  multigrid  iterative  methods  have  been 
developed  [35,36]  and  applied  to  the  nonlinear  potential  flow  problem  of 
fluid  dynamics  [29].  These  developments  will  be  described  in  this 
section. 
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Poisson's  Equation.  As 
model  problem,  but  this 
the  Poisson's  equation 


usual  the  discussion  will  begin  with  a  linear 
time  in  two  spatial  dimensions.  That  problem  is 


=  f 


(68) 


on  the  square  [-1,1]  x  [-1,1]  with  homogeneous  Dirichlet  boundary  con¬ 
ditions.  The  choice 


f(x,y)  =  -2tt2 


sin  ttx  sin  iry 


(69) 


corresponds  to  the  analytical  solution 


u(x,y)  =  sin  ttx  sin  Try.  (70) 

Both  Chebyshev  and  Legendre  spectral  methods  are  appropriate  for 
this  problem.  Direct  solution  schemes  for  the  Chebyshev  tau  method  have 
been  described  in  [31] .  The  same  schemes  also  work  for  the  Legendre  tau 
method  with  straightforward  modifications.  They  are  basically  of  an 
alternating  direction  implicit  (ADI)  nature  and  rely  on  the  quasi¬ 
tridiagonal  form  of  the  constant  coefficient,  one-dimensional  problem. 
Haidvogel  and  Zang  [31]  report  comparisons  of  the  Chebyshev  tau  method 
with  finite  difference  methods  on  numerous  problems.  They  discuss  both 
computational  efficiency  and  accuracy. 

These  direct  solution  schemes  cannot  feasibly  be  extended  to 
spectral  collocation  methods  because  the  collocation  equations  for  the 
one-dimensional  components  cannot  be  represented  by  sparse  matrices. 
However,  an  ADI  iterative  scheme  based  on  finite  difference  precondi¬ 
tioning  is  an  efficient  method  for  obtaining  an  approximate  solution. 
The  description  of  this  scheme  in  its  general  nonlinear  setting  begins 
by  writing  the  spectral  collocation  equations  as 


M(U)  =  0.  (71) 

Define  the  Jacobian 

J(U)-||(0>.  (72) 

In  many  cases  the  Jacobian  can  be  split  into  the  sum  of  two  operators 
JX(U)  and  J  (U),  each  involving  derivatives  in  only  the  one  coordinate 
direction  indicated  by  the  subscript.  The  most  straightforward  ADI 
method  is 


[al  -  Jx(V)][aI  -  Jy(V)]iV  =  aM(V),  (73) 

with  the  approximate  solution  V  updated  by 


V  V  +  WAV. 


(74) 
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This  is  just  the  Douglas-Gunn  version  of  ADI  [37].  The  term  approximate 

factorization  is  commonly  used  for  this  type  of  scheme  for  the  nonlinear 

potential  flow  problem  [38].  This  particular  scheme  is  referred  to  as 

AF1.  For  second-order  spatial  discretizations  the  term  [al  -  JX(V)] 

leads  to  a  set  of  tridiagonal  systems,  one  for  each  value  of  y.  The 

second  left-hand  side  factor  produces  another  set  of  tridiagonal 

systems.  For  spectral  discretizations,  however,  these  systems  are  full; 

hence,  Eq.  (73)  is  still  relatively  expensive  to  invert.  A  compromise  is 

to  replace  Jx  and  Jy  with  their  second-order  finite  difference 

analogs,  denoted  by  H  *  and  H  ,  respectively: 

x  y 

[al  -  H  (V) ] [al  -  H  (V)]iV  =  aM(V).  (75) 

x  y 

The  spectral  approximate  factorization  scheme  consists  of  Eqs.  (74)  and 
(75).  The  choice  of  the  iteration  parameters  is  discussed  in  [29]. 


Table  VIII.  Maximum  Error  for  Chebyshev  Approximations  to 
Poisson's  Equation 


N 

Truncated  Series 

Tau 

Collocation 

8 

2.88 

(-4) 

2.79 

(-3) 

1.17 

(-4) 

10 

6.79 

(-6) 

5.26 

(-5) 

2.33 

(-6) 

12 

1.09 

(-7) 

8.86 

(-7) 

3.12 

(-8) 

14 

1.34 

(-9) 

1.09 

(-8) 

3.27 

(-10) 

16 

1.19 

(-11) 

9.15 

(-11) 

2.73 

(-12) 

The  results  for  the  simple  model  problem  are  presented  in  Tables 
VIII  and  IX,  The  trends  are  the  same  as  they  were  for  the  heat 
equation:  the  collocation  method  is  more  accurate  than  tau  and  Legendre 
polynomials  are  more  accurate  than  Chebyshev.  (Since  it  is  not  practi¬ 
cal  to  design  a  spectral  method  for  PDE's  using  truncated  series,  those 
results  have  been  ignored  in  this  comparison.) 


Table  IX.  Maximum  Error  for  Legendre  Approximations  to 
Poisson's  Equation 


N 

Truncated  Series 

Tau 

Collocation 

8 

6.04 

(-4) 

1.55 

(-3) 

1.77 

(-5) 

10 

1.69 

(-5) 

3.40 

(-5) 

2.48 

(-7) 

12 

3.05 

(-7) 

6.05 

(-7) 

2.27 

(-9) 

14 

3.82 

(-9) 

6.98 

(-9) 

1.99 

(-11) 

16 

3.85 

(-11) 

6.37 

(-11) 

3.06 

(-10) 
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Spectral  Multigrid  Methods.  Iterative  schemes  for  spectral  collocation 
equations,  such  as  AF1,  can  be  accelerated  dramatically  by  applying 
multigrid  concepts.  This  technique  has  been  extensively  developed  for 
finite  difference  and  finite  element  discretizations  [40]  and  has 
recently  been  applied  to  spectral  discretizations  [35,36,29].  Briefly 
put,  multigrid  methods  take  advantage  of  a  property  shared  by  a  wide 
variety  of  relaxation  schemes  -  potential  efficient  reduction  of  the 
high-frequency  error  components  but  unavoidable  slow  reduction  of  the 
low-frequency  components. 


The  fundamentals  of  spectral  multigrid  are  perhaps  easiest  to  grasp 
for  the  simple  model  problem 


d2u 

dx2 


=  f 


(76) 


on  [0,2tt]  with  periodic  boundary  conditions.  The  Fourier  approximation 
to  the  left-hand  side  of  Eq.  (76)  at  the  collocation  points  is 


p« 


N/2-1 

I 


-N/2+1 


2  “ 

P  Ur 


ipx, 
e  J 


(77) 


The  spectral  approximation  to  Eq .  (76)  may  be  expressed  as 


LU  -  F, 


where 


U  ~  (VV-"  *UN-1^ * 

F  -  (fQ,f1,*-*,fN_1), 


(78) 

(79) 

(80) 


2  2 

and  L  represents  the  Fourier  spectral  approximation  to  -  d  /dx  . 

A  Richardson's  iterative  scheme  for  solving  Eq.  (78)  is 

V  <-  V  +  w(F  -  LV),  (81) 

where  m  is  a  relaxation  parameter.  On  the  right  side  of  the  replace¬ 
ment  symbol  («-)  V  represents  the  current  approximation  to  U,  and  on  the 
left  it  represents  the  updated  approximation.  The  eigenfunctions  of  L 
are 

5j(p)  =  e2llijp/N>  (82) 

with  the  corresponding  eigenvalues 

X(p)  =  p2,  (83) 
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where  j  -  0,1,***,N-1  and  p  =  -  N/2+1  ,*  *  •  ,N/2-l .  The  index  p  has 
a  natural  interpretation  as  the  frequency  of  the  eigenfunction. 

The  error  at  any  stage  of  the  iterative  process  is  V  -  U;  it  can  be 
resolved  into  an  expansion  in  the  eigenvectors  of  L.  Each  iteration 
reduces  the  p'th  error  component  to  v(A  )  times  its  previous  value, 
where  ^ 


v(X)  -  1  -  wA,  (84) 

The  optimal  choice  of  oj  results  from  minimizing  | v  (X )  I  for 
^  e  [^min»^max]  >  where  ^mln  =  ^  an£^  ^max  =  ^ 

(One  need  not  worry  about  the  p  =  0  eigenfunction  since  it  corresponds 
to  the  mean  level  of  the  solution,  which  is  at  one's  disposal  for  this 
problem,)  The  optimal  relaxation  parameter  for  this  single-grid  proce¬ 
dure  is 


2 

oG  A  +  A  . 
max  min 


(85) 


It  produces  the  spectral  radius 


A  -  A  . 

_  max  min 

SG  '  A - +~TT~  * 

max  min 

Unfortunately,  pgG  =  1  -  8/N2  ,  which  implies  that 
required  to  achieve  convergence. 


(86) 

0(N2)  iterations  are 


This  slow  convergence  is  the  outcome  of  balancing  the  damping  of 
the  lowest-frequency  eigenfunction  with  that  of  the  highest-f requency 
one  in  the  minimax  problem  described  after  Eq.  (84).  The  multigrid  ap¬ 
proach  takes  advantage  of  the  fact  that  the  low-frequency  modes  (|p|  < 
N/4)  can  be  represented  just  as  well  on  coarser  grids.  It  settles  for 
balancing  the  middle-frequency  eigenfunction  (|p|  =  N/4)  with  the 
highest-f requency  one  (|p|  =  N/2),  and  hence  damps  effectively  only 
those  modes  which  cannot  be  resolved  on  coarser  grids.  In  Eqs.  (85)  and 
(86),  Am^n  is  replaced  with  Am^d  =  A  (N/4)  .  The  optimal  relaxation 
parameter  in  this  context  is 


2 

^MG  A  +  A  .j 
max  mid 

The  multigrid  smoothing  factor 


max  mid 

yMG  "  A  +  A 

max  mid 


(87) 


(88) 
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measures  the  damping  rate  of  the  high-frequency  modes.  In  this  example 
^MG  “  0*60,  independent  of  N.  The  price  of  this  effective  damping  of 
the  high-frequency  errors  is  that  the  low-frequency  errors  are  hardly 
damped  at  all.  Table  X  compares  the  single-grid  and  multigrid  damping 
factors  for  N  “  64.  However,  on  a  grid  with  N/2  collocation  points, 
the  modes  for  I p I  e  [n/8,  N/4]  are  now  the  high-frequency  ones.  They 
get  damped  on  this  grid.  Still  coarser  grids  can  be  used  until  relaxa¬ 
tions  are  so  cheap  that  one  can  afford  to  damp  all  the  remaining  modes, 
or  even  to  solve  the  discrete  equations  exactly.  For  the  case  illustra¬ 
ted  in  Table  X  the  high-frequency  error  reduction  in  the  multigrid  con¬ 
text  is  roughly  250  times  as  fast  as  the  single-grid  reduction  for  N  =» 
64. 


Let  us  consider  just  the  interplay  between  two  grids.  A  general, 
nonlinear  fine-grid  problem  can  be  written 

Lf(Uf)  =  Ff.  (89) 

The  shift  to  the  coarse  grid  occurs  after  the  fine-grid  approximation 
vf  has  been  sufficiently  smoothed  by  the  relaxation  process,  i.e., 
after  the  high-frequency  content  of  the  error  Vf  -  Uf  has  been 
sufficiently  reduced.  The  related  coarse-grid  problem  is 

LC(UC)  =  Fc,  (90) 

where 

Fc  =  R(V  -  ITCV1)]  +  LC(RV  ).  (91) 

The  restriction  operator  R  interpolates  a  function  from  the  fine  grid 
to  the  coarse  grid.  The  coarse-grid  operator  and  solution  are  denoted 
by  Lc  and  Uc,  respectively.  After  an  adequate  approximation  Vc  to 
the  coarse-grid  problem  has  been  obtained,  the  fine-grid  approximation 
is  corrected  via 


Vf  «-  Vf  +  P(VC  -  RVf ) .  (92) 

The  prolongation  operator  P  interpolates  a  function  from  the  coarse 
grid  to  the  fine  grid. 

A  complete  nultigrid  algorithm  requires  specific  choices  of  the 
interpolation  operators,  the  coarse-grid  operators,  and  the  relaxation 
schemes.  These  issues  are  discussed  at  length  in  [35,36,29]  for  both 
Fourier  and  Chebyshev  multigrid  methods.  Numerous  linear,  variable  co¬ 
efficient  examples  are  also  provided  there.  The  more  interesting  non¬ 
linear  examples  from  [29]  are  the  subject  of  the  remainder  of  this 
paper. 


907 


Table  X.  Damping  Factors  for  N  =  64 


p 

Single-Grid 

Multigrid 

1 

.9980 

.9984 

2 

.9922 

.9938 

4 

.9688 

.9750 

8 

.8751 

.9000 

12 

.7190 

.7750 

16 

.5005 

.6000 

20 

.2195 

.3750 

24 

.1239 

.1000 

28 

.5298 

.2250 

32 

.9980 

.6000 

Application  to  Two-Dimensional  Potential  Flow.  Until  the  recent  work  of 
Streett  [39],  the  discretization  procedures  for  the  potential  equation 
were  invariably  based  on  low-order  finite  difference  or  finite  element 
methods*  Streett  used  a  spectral  discretization  of  the  full  potential 
equation  and  obtained  its  solution  by  a  single-grid  iterative  tech¬ 
nique.  The  application  of  spectral  multigrid  techniques  by  Streett,  et 
al.  [29]  produced  a  dramatic  acceleration  of  the  iterative  scheme.  Even 
in  its  relatively  primitive  state  the  spectral  multigrid  scheme  is 
competitive,  and  in  some  cases  unequivocally  more  efficient,  than 
standard  finite  difference  schemes. 

After  a  conformal  mapping  from  the  surface  of  an  airfoil  to  a 
circle  the  potential  equation  becomes 

<«) 

where  G  is  the  reduced  potential,  R  and  0  are  the  computational 
polar  coordinates,  and  p  is  the  fluid  density.  The  reduced  potential 
is  periodic  in  ©  and  it  satisfies 

at  R  =  1,  (94) 

as  R  *  00  ,  (95) 


G  >  0 


and  the  Kutta  condition.  The  density  is  given  by  the  isentropic  rela¬ 
tion 
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p 


(96) 


the  ratio  of  specific  heats  is  denoted  by  y,  and  M^  is  the  Mach  number 
at  infinity.  The  velocity  components  in  the  physical  (r,9)  plane  are 


_ 13# 

qr  “  H  3R 

(97) 

1  3# 

qe  "  rh  30  * 


and  the  Jacobian  between  the  complex  physical  plane  (z  »  reJ‘°)  and  the 
complex  computational  plane  (o  ■  Re*®)  is 


H  - 


(98) 


Further  details  are  provided  in  [39]. 

The  spectral  method  employs  a  Fourier  series  representation  in  0. 
Constant  grid  spacing  in  0  corresponds  to  a  convenient  dense  spacing 
in  the  physical  plane  at  the  leading  and  trailing  edges.  The  domain  in 
R  (with  a  large,  but  finite  outer  cutoff)  is  mapped  onto  the  standard 
Chebyshev  domain  [-1,1]  by  an  analytical  stretching  transformation  that 
clusters  the  collocation  points  near  the  airfoil  surface.  The  stretch¬ 
ing  is  so  severe  that  the  ratio  of  the  largest-to-smallest  radial 
intervals  is  typically  greater  than  1000. 

The  flow  past  an  NACA  0012  airfoil  at  4°  angle  of  attack  and  a 
frees tream  Mach  number  of  0.5  is  a  challenging  subsonic  and  thus  ellip¬ 
tic  case.  Nevertheless,  the  spectral  solution  on  a  relatively  coarse 
grid  captures  all  the  essential  details  of  the  flow.  The  surface  pres¬ 
sure  coefficient  from  the  spectral  code  MGAFSP  [29]  using  16  points  in 
the  radial  (R)  direction,  and  32  points  in  the  azimuthal  (0)  direction 
is  displayed  in  Fig.  5.  The  symbols  denote  the  solution  at  the  colloca¬ 
tion  points.  For  comparison,  the  result  from  the  finite  difference, 
multigrid,  approximate  factorization  code  FL036  [41]  is  shown  as  a  solid 
line.  The  grid  used  in  the  benchmark  finite  difference  calculation  is 
so  fine  (64  x  384  points)  that  the  truncation  error  is  well  below 
plotting  accuracy.  The  FL036  and  MGAFSP  results  are  identical  to 
plotting  accuracy.  The  spectral  computation  on  this  mesh  yields  a  lift 
coefficient  with  truncation  error  less  than  10”  .  Spectral  solutions 
on  a  16  x  32  grid  are  thus  of  more  than  adequate  resolution  and 
accuracy  for  subsonic  flows. 
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In  Fig.  6  are  shown  convergence  histories  from  FL036,  MGAFSP,  and 
the  finite  difference,  approximate  factorization,  single-grid  code  TAIR 
[42] .  Meshes  which  yield  approximately  equivalent  accuracy  were 
chosen.  The  surface  pressure  results  are  the  same  to  plotting  accuracy, 
the  lift  coefficient  is  converged  in  the  third  decimal  place,  and  the 
predicted  drag  coefficient  is  less  than  .001.  (Actually,  the  spectral 
result  is  an  order  of  magnitude  more  accurate  than  these  limits,  but  the 
TAIR  result  barely  meets  them.)  Figure  7  demonstrates  the  improvement 
produced  by  the  spectral  multigrid  scheme  over  the  spectral  single-grid 
method  (AFSP) .  There  is  well  over  an  or der-of -magnitude  gain  in 
efficiency. 

V.  A  MIXED  EQUATION.  The  potential  flow  problem  is  much  more 
difficult  whenever  the  flow  field  contains  both  supersonic  (hyperbolic) 
and  subsonic  (elliptic)  regions.  Nevertheless,  the  spectral  multigrid 
algorithm  that  succeeded  for  the  subsonic  flow  case  requires  only  a 
minor  modification  in  order  to  succeed  for  the  transonic  (mixed)  problem 
as  well. 

The  most  expedient  technique  for  dealing  with  the  mixed  elliptic- 
hyperbolic  nature  of  the  transonic  problem  is  to  use  the  artificial 
density  approach  of  Hafez,  et  al.  [43].  The  original  artificial  density 
is  ^  «- 

p  -  p  -  y<5p  (99) 

with 

\i  -  max{0,l  -  iy}  ,  (100) 

VT 

where  M  is  the  local  Mach  number  and  5p  is  an  upwind  first-order 
(undivided)  difference.  The  spectral  calculations  employed  a  higher- 
order  artificial  density  formula.  The  spectral  method  also  required  a 
weak  filtering  technique  to  deal  with  some  high-frequency  oscillations 
generated  by  the  shock.  Details  are  available  in  [39]. 

Flow  Past  an  Airfoil.  A  lifting  transonic  case  Is  provided  by  the  NACA 
0012  airfoil  at  M^  =  0.75  and  2°  angle  of  attack.  A  shock  appears 
only  on  the  upper  surface  for  these  conditions  and  is  rather  strong  for 
a  potential  calculation;  the  normal  Mach  number  ahead  of  the  shock  is 
about  1.36.  Lifting  transonic  cases  are  especially  difficult  for 
spectral  methods  since  the  solution  will  always  have  significant  content 
in  the  entire  frequency  spectrum:  the  shock  populates  the  highest 

frequencies  of  the  grid  and  the  lift  Is  predominantly  on  the  scale  of 
the  entire  domain.  An  iterative  scheme  therefore  must  be  able  to  damp 
error  components  across  the  spectrum. 

Surface  pressure  distributions  from  MGAFSP,  TAIR,  and  FL036  are 
shown  in  Fig.  8.  The  respective  computational  grids  are  18  x  64, 

30  x  149,  and  32  *  192.  The  latter  two  are  the  default  grids  for  the 
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production  finite  difference  codes.  Spectral  results  obtained  by  trigo¬ 
nometrically  interpolating  the  18  x  64  grid  results  onto  a  much  finer 
grid  are  included  alongside  the  results  at  the  collocation  points.  This 
reveals  the  wealth  of  detail  that  is  provided  by  the  rather  coarse  spec¬ 
tral  grid.  The  shock  predicted  by  TAIR  is  far  more  rounded  and  smeared 
than  that  of  FL036,  reflecting  the  coarser  mesh  and  larger  artificial 
viscosity  used  in  the  former.  The  TAIR  result  shown  is  also  only 
correct  to  one  decimal  place  in  lift  as  compared  with  a  finer-grid 
result.  Convergence  histories  for  these  three  cases  are  shown  in  Fig.  9 
along  with  the  results  for  MGAFSP  on  a  coarser  grid  (16  x  48). 

Flow  Fast  a  Circular  Cylinder.  The  MGAFSP  code  has  recently  been  used 
for  an  extremely  accurate  determination  of  the  critical  freestream  Mach 
number  at  which  the  potential  flow  past  a  circular  cylinder  first 
develops  a  supersonic  region  [44],  This  spectral  calculation  represents 
an  alternative  to  the  asymptotic  series  method  employed  by  van  Dyke  and 
Guttmann[45]  to  arrive  at  the  estimate  Mcrit  =  .39823780  ±  ,00000001. 

The  spectral  multigrid  potential  code  was  used  to  determine  the 
critical  Mach  number  on  several  grids.  On  each  of  these  grids  calcula¬ 
tions  were  performed  at  a  half-dozen  or  so  freestream  Mach  numbers.  For 
each  case  the  maximum  local  Mach  number  was  determined  from  the  computed 
solution.  Then  an  extrapolation  procedure  was  employed  to  ascertain 
what  freestream  Mach  number  produced  a  maximum  local  Mach  number  of 
unity.  This  value  was  recorded  as  the  critical  Mach  number  for  that 
particular  grid.  An  estimate  of  the  extrapolation  error  was  made  to 
ensure  consistency.  These  results  are  given  in  Table  XX. 

Finally,  these  grid-dependent  calculations  of  the  critical  free¬ 
stream  Mach  number  were  extrapolated  to  the  limit  of  infinite  numerical 
resolution.  The  best  result  was  obtained  by  assuming  sixth-order 
convergence.  The  final  estimate  of  the  critical  freestream  Mach  number 
is  Mcrit  =  .3982415  ±  ,0000002.  The  difference  between  this  estimate 
and  the  one  by  van  Dyke  and  Guttmann  is  more  than  an  order-of-magnitude 
greater  than  the  estimated  errors.  Possible  explanations  for  this 
discrepancy  are  discussed  in  [44],  Nevertheless,  the  agreement  of  the 
two  estimates  to  better  than  one  part  in  10*  is  remarkable  in  itself. 

Note  that  the  convergence  rate  of  the  spectral  multigrid  potential 
result  (at  least  sixth-order)  pertains  to  a  quantity  (critical  free¬ 
stream  Mach  number)  which  requires  the  fundamental  solution  (the 
potential)  to  be  first  differentiated  and  then  extrapolated.  Moreover, 
the  MGAFSP  code  is  so  efficient  that  all  of  the  requisite  calculations 
consumed  less  than  20  minutes  of  CPU  time  on  the  CDC  Cyber  175  and  were 
performed  on  grids  with  no  more  than  2000  points. 

A  comparable  calculation  by  existing  finite  difference  codes  would 
likely  exhibit  only  first-order  convergence.  It  would  be  far  more 
expensive  both  in  terms  of  CPU  time  and  storage,  surely  exceeding  the 
central  memory  of  a  machine  such  as  the  CDC  Cyber  175.  Here  then  is  an 
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example  which  firmly  establishes  the  utility  of  spectral  methods  for 
nonlinear,  multi-dimensional  problems. 


Table  XX.  Grid-dependent  Critical  Freestreaa  Mach  Numbers 


Grid 

Mcrit 

Error  Estimate 

14  x  32 

.398289 

.000048 

18  x  40 

.3982514 

.0000099 

22  x  48 

.3982450 

.0000035 

30  x  64 

.3982422 

.0000007 

Research  of~the  first  author  was  supported  by  NASA  Contract  Nos.  NAS1- 
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ACOUSTIC  RESPONSE  TO  10° 
INCIDENT  ACOUSTIC  WAVE 

Mach  3 


Post-shock  dependence  of 
the  pressure  resonse  to  a 
pressure  wave  incident  at 
10  to  a  Mach  3  shock. 

The  solid  line  is  the 
linear  theory  prediction. 
The  circles  are  the 
spectral  solition. 


Figure  2 


ACOUSTIC  TRANSMISSION  COEFFICIENT 
FOR  INCIDENT  ACOUSTIC  WAVES 
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Dependence  on  incident  angle 
of  the  pressure  responce  to 
a  0.1%  amplitude  pressure 
wave  incident  on  a  Mach  3 
shock.  The  solid  line  is  the 
linear  theory  result. 

Circles  are  spectral 
solutions;  squares  are  finite 
difference  solutions. 


Figure  3 
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Figure  4 


Eigenvalues  of  the  pre-conditioned  matrices 
for  the  semi-implicite  channel  flow  when  the 
streamwise  wave  number  k  =  5.  The  grid  is 
32  x  17,  the  Reynolds  number  is  7500  and  the 
CFL  number  is  0.10.  Note  the  different  scale 
used  for  the  central  differences  pre¬ 
conditioning  results. 


Maximum  residual  versus  machine  time  foe  a  subsonic  flow 
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Error  in  lift  versus  machine  time  for  a  subsonic  flow  from 
single-grid  (AFSP)  and  raultigrid  (MGFSP)  spectral  schemes. 
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Surface  pressures  for  a  transonic  flow. 
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Figure  9 

Maximum  residual  versus  machine  time  for  a  transonic  flow. 
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